spark
https://github.com/apache/spark
Scala
Apache Spark - A unified analytics engine for large-scale data processing
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Scala not yet supported75 Subscribers
View all SubscribersAdd a CodeTriage badge to spark
Help out
- Issues
- [SPARK-57328][SQL] Extract coalesce hint-building helpers into CoalesceHintUtils
- [WIP][SPARK-57421][SQL][CONNECT] Support @-syntax version and timestamp time travel on table names
- [SPARK-57420][INFRA] Add generate-tpcds input and early CPU check to benchmark workflow
- [SPARK-57356][SDP] Implement SCD2 Batch Processor; Cleanup Delete Encoding Rows Post-Reconciliation
- [SPARK-57378][SDP] Implement SCD2 Batch Processor; Merge Reconciled Rows into Aux and Target Tables
- [SPARK-57394][PYTHON] Refactor SQL_ARROW_TABLE_UDF
- [SPARK-56975][SS] Warn when DataStreamReader.table() is given a user-specified schema
- [INFRA] Add backport confirmation and show push target branch in merge_spark_pr.py
- [SPARK-56596][SQL] Enable dual runs for single-pass analyzer
- [SPARK-57322][SDP] Implement SCD2 Batch Processor; Reconcile StartAt/EndAt
- Docs
- Scala not yet supported