spark
https://github.com/apache/spark
Scala
Apache Spark - A unified analytics engine for large-scale data processing
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Scala not yet supported76 Subscribers
View all SubscribersAdd a CodeTriage badge to spark
Help out
- Issues
- [SPARK-56745][SQL] Cache foldable ZoneId in ConvertTimezone to avoid per-row lookup
- [SPARK-56482][SQL][4.3] Enable whole-stage codegen fusion for `UnionExec`
- [SPARK-56482][SQL][4.2] Enable whole-stage codegen fusion for `UnionExec`
- [SPARK-56768][PYTHON][INFRA] Share SBT compile artifact across pyspark CI jobs
- [SPARK-28587][SQL] Route JDBC partition bound literals through JdbcDialect.compileValue
- Bump io.netty:netty-transport-native-epoll from 4.2.12.Final to 4.2.13.Final
- [MINOR][INFRA] Ignore AGENTS.md and CONTRIBUTING.md in determine_modules_for_files
- [Backport 4.x][SPARK-56324] Add ZeroCopyByteStream to enable PySpark <-> Spark message-based communication
- [SPARK-56448][CONNECT] Fix NPE on Spark Connect client restart due to ammonite compile cache
- [SPARK-56756][SQL] Add error class for recursiveFileLookup conflict with partitioned data source
- Docs
- Scala not yet supported