spark
https://github.com/apache/spark
Scala
Apache Spark - A unified analytics engine for large-scale data processing
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Scala not yet supported76 Subscribers
View all SubscribersAdd a CodeTriage badge to spark
Help out
- Issues
- Feature request: infer field names in json_tuple
- [SPARK-55810][UI] Fix missing spacing between table and pagination controls in Jobs and Stages page
- [SPARK-56687][SQL] Support netChanges for DSv2 CDC streaming reads
- [SPARK-56677][SQL] Propagate filter conditions through `Join` nodes in `PlanMerger`
- [SPARK-56686][SQL] Support streaming row-level CDC post-processing
- [SPARK-56663][SQL] Restore fast path for date_trunc MINUTE/HOUR/DAY
- [WIP][SPARK-56661] Introducing logical and physical planning nodes for language-agnostic Spark UDFs
- [SPARK-56648][PYTHON] Refactor SQL_SCALAR_PANDAS_UDF
- [INFRA] Document test base class hierarchy in AGENTS.md
- [SPARK-56674][SS] Add streaming shuffle wire protocol
- Docs
- Scala not yet supported