spark
https://github.com/apache/spark
Scala
Apache Spark - A unified analytics engine for large-scale data processing
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Scala not yet supported76 Subscribers
View all SubscribersAdd a CodeTriage badge to spark
Help out
- Issues
- [SPARK-55241][SQL]. Fix SQL Streaming idempotency broken, with PropagateEmptyRelation and InferFiltersFromConstraint run as part of Fixed Number of iterations
- [MINOR] Raise exception if no active Spark context
- [SPARK-55271][SS] The error originates in KafkaMicroBatchStream.metrics() at line 520, where the code attempts to call .get() on a Scala Option that contains null, then immediately invokes .map() on the null result
- [WIP] POC for serializer changes
- [WIP][SPARK-53928][SQL] Enhance DSV2 partition filtering using catalyst expression
- [SPARK-53890][SDP] Test (and fix) read/readstream options are respected for pipelines
- [SPARK-48750][SQL] AQEPropagateEmptyRelation convert broadcast query stage plan to empty relation causing error
- [SPARK-49671][SQL] Remove the RTRIM collation config
- [SPARK-54890][PYTHON] Allow users to enforce timezone match for timestamp conversion
- [SPARK-54876][SQL] The splitSemiColon function should correctly split SQL statements
- Docs
- Scala not yet supported