spark
https://github.com/apache/spark
Scala
Apache Spark - A unified analytics engine for large-scale data processing
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Scala not yet supported76 Subscribers
View all SubscribersAdd a CodeTriage badge to spark
Help out
- Issues
- Prototype: support eager analysis inside Pipelines query functions
- [SPARK-54928] Add shuffle migration statistics to MigrationInfo
- [SPARK-54947][CORE] Refactor block mapping with BlockInfoGroup for better block management
- [SPARK-54974][SQL] Always propagate static confs to views/UDFs from active session
- [DRAFT][SQL][PYTHON][CONNECT] Enable xxhash64 with seed parameter
- [SPARK-54978][CORE] Change log level to waring in UnifiedMemoryManager
- [SPARK-54921][CORE] Add parentIds in StageData
- [SPARK-45414][SQL][TESTS] Add regression tests for XML mixed type serialization
- [SPARK-55107][SQL] Log TID for scanned file in FileScanRDD
- [SPARK-55120][K8S] Change driver/executor uuid to be val instead of def
- Docs
- Scala not yet supported