spark
https://github.com/apache/spark
Scala
Apache Spark - A unified analytics engine for large-scale data processing
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Scala not yet supported76 Subscribers
View all SubscribersAdd a CodeTriage badge to spark
Help out
- Issues
- [SPARK-56538][CONNECT] Add per-RPC deadlines to Spark Connect client
- [SPARK-56429][DOCS] Clarify differences between nullValue and emptyValue CSV options
- [SPARK-56488][BUILD][3.5] Bump Scala 2.13 to 2.13.9 on branch-3.5
- [SPARK-56312][PYTHON] Refactor SQL_COGROUPED_MAP_ARROW_UDF
- [SPARK-56519][PYTHON] Isolate communication part from python udf worker
- Fetching of blocks of cache table may cause high memory pressure
- [SPARK-38101][CORE] Retry INTERNAL_ERROR_BROADCAST when fetching map statuses
- [SPARK-56568][SQL] Add id() to DSv2 Column to detect drop-and-re-add columns at refresh time
- [WIP][SPARK-56514][SQL] Allow catalog Table objects to be passed directly to read.table() and writeTo()
- [SPARK-56505][SQL][TESTS] SharedSparkSession provides sql.SparkSession, add SharedClassicSparkSession, ClassicSQLTestUtils
- Docs
- Scala not yet supported