spark
https://github.com/apache/spark
Scala
Apache Spark - A unified analytics engine for large-scale data processing
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Scala not yet supported75 Subscribers
View all SubscribersAdd a CodeTriage badge to spark
Help out
- Issues
- [SPARK-55203][PYTHON] Support PathLike in readwriter paths
- [SPARK-56705][PYSPARK][SS] Introduce a JVM-bridged MemoryStream wrapper for PySpark tests (Part 1 of 4)
- [SPARK-55872][UI] Highlight long-running SQL queries by duration
- [SPARK-XXXXX][SQL] Public Column.toJson / Column.fromJson on the V2 catalog interface
- [SPARK-56698][PYTHON] Add Spark MCP (Model Context Protocol) server
- [SPARK-56690][SQL] - Expose common `TaskMemoryManager` API on `HashedRelation` to avoid code duplication
- fix: remove bare excepts and clean up lint issues (quantum-local-fixer)
- Feature request: infer field names in json_tuple
- [SPARK-56227][CORE] Fix GcmTransportCipher to correctly handle multiple messages per channel
- [SPARK] Memory pressure during block fetching of cached tables causes excessive memory usage
- Docs
- Scala not yet supported