spark
https://github.com/apache/spark
Scala
Apache Spark - A unified analytics engine for large-scale data processing
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Scala not yet supported76 Subscribers
View all SubscribersAdd a CodeTriage badge to spark
Help out
- Issues
- [WIP][SPARK-56479][SQL] df.cache() with DSv2 interfaces to enable V2ScanRelationPushDown optimizer rules
- [SPARK-56227][CORE] Fix GcmTransportCipher to correctly handle multiple messages per channel
- [SPARK-56175][SQL] FileTable implements SupportsPartitionManagement and V2 catalog table loading
- [SPARK-38101][CORE] Fix executors failing fetching map statuses with INTERNAL_ERROR_BROADCAST
- AES-GCM for RPC encryption does not work on YARN
- [SPARK-56200][CORE] Remove jackson JSON nesting depth limitation
- [SPARK-56199][CORE] Read fallback storage blocks asynchronously and multithread
- [SPARK-56171][SQL] Enable V2 file write path for non-partitioned DataFrame API writes and delete `FallBackFileSourceV2`
- [SPARK-55330][K8S] Make spark.kubernetes.legacy.useReadWriteOnceAccessMode a public config
- [SPARK-56160][SQL] Add DataType classes for nanosecond timestamp types
- Docs
- Scala not yet supported