spark
https://github.com/apache/spark
Scala
Apache Spark - A unified analytics engine for large-scale data processing
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Scala not yet supported76 Subscribers
View all SubscribersAdd a CodeTriage badge to spark
Help out
- Issues
- [SPARK-55731][Streaming] Assign error class for multiple event time columns
- Is Spark limited to split the Parquet read granularity by Row Group level only?
- [SPARK-56760][PYTHON] Remove dead numpy version check in pandas typehints
- [SPARK-56791][SQL] Add bulk read+widen path for INT32 to Long Parquet vector updater
- Test Netty 4.2.13
- [SPARK-56765][INFRA] Fix mypy attr-defined errors with PyArrow 24+
- [SPARK-56763][INFRA] Fix docker image build failures on branch-3.5
- [SPARK-56760][PYTHON] Remove dead numpy 1.21 version check in pandas typehints
- [SPARK-34591][ML] Add decision tree pruning as a parameter
- [SPARK-56745][SQL] Cache foldable ZoneId in ConvertTimezone to avoid per-row lookup
- Docs
- Scala not yet supported