spark
https://github.com/apache/spark
Scala
Apache Spark - A unified analytics engine for large-scale data processing
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Scala not yet supported75 Subscribers
View all SubscribersAdd a CodeTriage badge to spark
Help out
- Issues
- [SPARK-57341][INFRA] Reconcile JIRA components with the PR title in merge script
- [SPARK-57389][PYTHON][TESTS] Use dedicated Pandas 3 golden files for pandas UDF coercion tests
- [SPARK-57296][SPARK-57346][SQL] Fix incorrect aggregate resolution in ORDER BY with GROUPING ANALYTICS
- fix: upgrade com.squareup.okhttp3:okhttp to 4.9.2 (CVE-2021-0341)
- [SPARK-57223][SQL] INCOMPATIBLE_DATA_FOR_TABLE errors mis-quote a column/field name that contains a dot
- Update docs/control-flow/leave-stmt.md
- [SPARK-57269][SS] Enforce read-only access in the StateDateSource/StateMetadataSource
- refactor: replace misleading approx_top_k terminology with frequent items naming
- [SPARK-57268][SQL] Add Apache Arrow as a native cache format for in-memory Dataset caching
- [SPARK-56375][PANDAS/PS] Implement DataFrame.set_axis and Series.set_axis
- Docs
- Scala not yet supported