spark
https://github.com/apache/spark
Scala
Apache Spark - A unified analytics engine for large-scale data processing
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Scala not yet supported75 Subscribers
View all SubscribersAdd a CodeTriage badge to spark
Help out
- Issues
- [SPARK-56335][SQL] Implement SupportsMetadataColumns in FileTable
- [SPARK-56371][SQL] Support _metadata.row_index for V2 Parquet reads
- [SPARK-52669][PYSPARK] Fix Python executable selection for YARN client mode #51357
- [SPARK-53848][SQL] Add ability to support Alpha family in Theta Aggregates
- [SPARK-56374][BUILD] Align SBT assembly shade rules with Maven
- Can't install prerelease using pip because version check doesn't like the version number.
- [SPARK-56388][CONNECT] Add XML support to Spark Connect Parse protocol
- [SPARK-56451][DOCS][SDP] Document how SDP datasets are stored and refreshed
- [SPARK-56375][PS] Implement DataFrame.set_axis and Series.set_axis
- Implement `SET` command in single-pass analyzer
- Docs
- Scala not yet supported