spark
https://github.com/apache/spark
Scala
Apache Spark - A unified analytics engine for large-scale data processing
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Scala not yet supported75 Subscribers
View all SubscribersAdd a CodeTriage badge to spark
Help out
- Issues
- [SPARK-56375][PANDAS/PS] Implement DataFrame.set_axis and Series.set_axis
- [SPARK-55818][PS] Decimal-float mixed arithmetic should always raise TypeError
- [SPARK-57275][CONNECT] Validate row count after consuming all arrow batches
- Fix name for frequent items/heavy hitters sketch from highly misleading "approx top k"
- [WIP][SPARK-55206][PYTHON][SQL] Transpilation minimal functional implementation with python
- [SPARK-57222][SDP] Implement SCD2 Batch Processor; Decompose affected rows
- [SPARK-57225][SS] Reject write operations on state data sources with clear error
- [SPARK-56632][CONNECT][TESTS][4.1] Add E2E test for self-join reusing a DataFrame
- [SPARK-56632][CONNECT][TESTS][4.2] Add E2E test for self-join reusing a DataFrame
- [SPARK-57220][SQL] Extend block-chunked segment-tree window frame to shrinking frames
- Docs
- Scala not yet supported