spark
https://github.com/apache/spark
Scala
Apache Spark - A unified analytics engine for large-scale data processing
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Scala not yet supported76 Subscribers
View all SubscribersAdd a CodeTriage badge to spark
Help out
- Issues
- [SPARK-55338][CONNECT] Centralize Spark Connect request decompression logic in gRPC interceptor
- [SPARK-54881][SQL][FOLLOWUP] Extract simplifyNot method in BooleanSimplification
- [SPARK-55335][PYTHON][TESTS] Use eventually instead of hard-coded wait for datasource test
- [SPARK-55336][PYTHON] Let createDF use create_batch logic for decoupling
- [SPARK-55326][PYTHON][Connect] Release remote session when SPARK_CONNECT_RELEASE_SESSION_ON_EXIT is set
- [SPARK-55337][SS] Fix MemoryStream backward compatibility
- [SPARK-55328][SQL][PYTHON] Reuse PythonArrowInput.codec in GroupedPythonArrowInput
- [SPARK-55327][K8S] Reduce Spark docker image sizes
- [SPARK-54969][PYTHON] Implement new arrow->pandas conversion
- [SPARK-55323][PYTHON] Move UDF metadata to EvalConf to simplify worker protocol
- Docs
- Scala not yet supported