spark
https://github.com/apache/spark
Scala
Apache Spark - A unified analytics engine for large-scale data processing
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Scala not yet supported76 Subscribers
View all SubscribersAdd a CodeTriage badge to spark
Help out
- Issues
- [SPARK-46367][SQL] Support narrowing projection of `KeyedPartitioning` in `PartitioningPreservingUnaryExecNode`
- [WIP][SPARK-56608][PYTHON] Migrate grouped/cogrouped map Arrow UDF verify checks into enforce_schema
- Support Catalog Store
- [SPARK-56594][SQL] Add time_bucket scalar function
- [SPARK-54022][SPARK-56617][SQL][TESTS] Add more CACHE TABLE tests and reorganize CACHE TABLE tests
- [SPARK-56612][PYTHON] Unify verify_result and container-type checks into verify_return_type helper
- [SPARK-56568][SQL] Add additional column ID test coverage for DSv2 Column.id()
- [SPARK-54119] Support METRIC_VIEW creation on V2 catalogs
- [SPARK-56572][SDP] Inject Spark session into Python files
- [SPARK-56589][CORE] Use Virtual Threads for BlockManagerMasterEndpoint ask thread pool
- Docs
- Scala not yet supported