spark
https://github.com/apache/spark
Scala
Apache Spark - A unified analytics engine for large-scale data processing
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Scala not yet supported76 Subscribers
View all SubscribersAdd a CodeTriage badge to spark
Help out
- Issues
- [SPARK-55978][SQL] Add TABLESAMPLE SYSTEM block sampling with DSv2 pushdown
- [SPARK-56015][INFRA][DOCS] Cleanup docs container, remove unused R deps, and fix x86 build.
- [WIP][DO-NOT-REVIEW][SPARK-55886][SQL] Add `DataFrame.zip` for merging column-projected DataFrames
- [DOCS] Document return types for aggregate functions (stddev, variance, etc.)
- [MINOR] Fix typo errors in code comments across multiple modules
- [SPARK-53440][PYTHON] Allow Column.transform() to accept SQL lambda expression strings
- [MINOR] Add quotes to fix erroneous pip install command in SDP Prog Guide
- [SPARK-56252][HISTORY] History server disk store lease should remove temp path delete hook after commit and rollback to avoid memory leak
- [SPARK-56147][SQL] `spark-sql` cli correctly handles SQL Scripting compound blocks
- [SPARK-56152][SQL] Enable implicit cast from STRING to TIME type
- Docs
- Scala not yet supported