spark
https://github.com/apache/spark
Scala
Apache Spark - A unified analytics engine for large-scale data processing
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Scala not yet supported76 Subscribers
View all SubscribersAdd a CodeTriage badge to spark
Help out
- Issues
- [SPARK-54418][SQL][PYSPARK] Support path-like targets in DataFrame.mergeInto
- Add `Information for new contributors` in Issues
- [WIP][SPARK-55657][BUILD] Bump Hadoop 3.5.0 RC0
- [SPARK-55658][PYTHON] SparkSessionBuilder.create in PySpark classic should mirror getOrCreate path as much as possible
- [SPARK-55626][SQL][4.1] Don't load metadata columns on Table unless needed in V2TableUtil
- Suggestion: reference WFGY Problem Map (RAG / LLM debugging checklist) for Spark + LLM workloads
- [PYTHON] Support path-based table reference in `DataFrame.mergeInto`
- Catalyst optimizer non-convergence with iterative withColumn rewrite + filter pushdown in Spark
- [TEST] `test_session.py` does not work properly
- [SPARK-55610][PYTHON] Add getExecutorInfos to StatusTracker in Python
- Docs
- Scala not yet supported