spark
https://github.com/apache/spark
Scala
Apache Spark - A unified analytics engine for large-scale data processing
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Scala not yet supported76 Subscribers
View all SubscribersAdd a CodeTriage badge to spark
Help out
- Issues
- [SPARK-43829][CONNECT] Improve SparkConnectPlanner by reuse Dataset and avoid construct new Dataset
- [SPARK-45579][CORE] Catch errors for FallbackStorage.copy
- [SPARK-45373][SQL] Minimize partitions fetch call to HiveMetaStoreLayer
- [TEST ONLY][SQL] Test resolve column references with PLAN_ID
- [SPARK-22876][YARN] Respect YARN AM failure validity interval
- [WIP][SPARK-24815] [CORE] Trigger Interval based DRA for Structured Streaming
- [SPARK-44639][SS][YARN] Use Java tmp dir for local RocksDB state storage on Yarn
- [SPARK-44635][CORE] Handle shuffle fetch failures in decommissions
- [SPARK-44609][K8S] Remove executor pod from PodsAllocator if it was removed from scheduler backend
- [SPARK-44571][SQL] Eliminate the Join by combine multiple Aggregates
- Docs
- Scala not yet supported