spark
https://github.com/apache/spark
Scala
Apache Spark - A unified analytics engine for large-scale data processing
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Scala not yet supported76 Subscribers
View all SubscribersAdd a CodeTriage badge to spark
Help out
- Issues
- [SPARK-55175][PYTHON] Extract `to_pandas` transformer from serializers
- [SPARK-55185][SQL]. Fix for idempotency being broken, if InferFiltersFromConstraint rule is run as part of post optimization batch with fixed iterations
- [SPARK-46165][PS] Add support for DataFrame.all axis=None
- [WIP][SPARK-55221][PYTHON] Add `to_arrow` transformer and remove `_create_struct_array`
- [SPARK-54179][SQL][FOLLOW-UP] Add Dataframe API support for Tuple sketches
- [SPARK-55086][PYTHON] Add DataSourceReader.pushFilters to Python Data Source API docs
- [SPARK-53745][PYTHON] Update mlflow to 3.1.0
- [SPARK-54342][BUILD] Unify scalatest format reports between SBT and maven builds
- [SPARK-44988][SQL] Support reading Parquet TIMESTAMP(NANOS,false)
- [SPARK-54554][SQL] Enable Dynamic Partition Pruning with CommandResult
- Docs
- Scala not yet supported