spark
https://github.com/apache/spark
Scala
Apache Spark - A unified analytics engine for large-scale data processing
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Scala not yet supported75 Subscribers
View all SubscribersAdd a CodeTriage badge to spark
Help out
- Issues
- [SPARK-57599][SQL] Decode Variant strings and keys as UTF-8
- [SPARK-57604][BUILD] Upgrade log4j to 2.26.0 to fix JDK 25 ThrowableStackTraceRenderer NPE
- [SPARK-56539][CORE] Fix exitStatusCode 0 for unrecognized spark-submit options
- [SPARK-57593][SQL][PYTHON] Byte-bound and instrument the pickle Python UDF input batch
- [SPARK-57532][CORE][TESTS] Add a test suite for StringSubstitutor and tidy its default-value length bookkeeping
- [SPARK-57600][SQL] Declarative Pipelines should isolate per-flow SQL confs during parallel flow resolution
- [SPARK-57595][PYTHON] Declarative Pipelines analysis context should not mask the original error when registration fails
- [WIP][SPARK-55444][SQL] Introduce and Route TimeType to Parquet vectorized read through the Types Framework
- [SPARK-57597][CORE] Guard ByteUnit.toBytes against long overflow
- [SPARK-57555][SQL] Support TIME data type in built-in JDBC data source
- Docs
- Scala not yet supported