spark
https://github.com/apache/spark
Scala
Apache Spark - A unified analytics engine for large-scale data processing
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Scala not yet supported76 Subscribers
View all SubscribersAdd a CodeTriage badge to spark
Help out
- Issues
- [SPARK-56355][SQL] Improve join stats estimation when equi-join keys lack column statistics
- [SPARK-55242][PYTHON] Handle np.ndarray elements in list-valued columns when converting from pandas
- Webpage for downloading binaries is broken
- [SPARK-40328][PS] Implement DataFrame.compare
- [SPARK-56320] Introduce JIT compilation metrics
- [SPARK-48275][SQL][DOCS] Improve array_sort default comparator documentation
- [SPARK-56325][SDP] Refactor FlowSystemMetadata.flowCheckpointsDirOpt to avoid scala.Option
- [SPARK-56304][SQL] Support IF NOT EXISTS for V2 file table INSERT OVERWRITE
- [SPARK-54055][CONNECT][PYTHON] Clean up per-session PythonWorkerFactory
- [SPARK-56158][CORE] Support limitActiveProcessorCount in local mode
- Docs
- Scala not yet supported