spark
https://github.com/apache/spark
Scala
Apache Spark - A unified analytics engine for large-scale data processing
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Scala not yet supported75 Subscribers
View all SubscribersAdd a CodeTriage badge to spark
Help out
- Issues
- [SPARK-56468][UDF] Validate required worker capabilities in direct dispatcher
- [SPARK-56891][CONNECT] Propagate Spark Connect session user as a SparkContext local property
- User Defined Functions crash Spark Dataframes created directly, but not for ones made from Pandas on Spark.
- Feature request: inferSchema do not infer digit strings that start with 0 as integer
- [SPARK-55792][PS] Optimize DataFrame diff axis=0
- [SPARK-50593][SQL] SPJ: Support truncate transform via generalized ReducibleFunction API
- [SPARK-56729][SQL] ReplaceData and WriteDelta should implement SupportsNonDeterministicExpression
- [SPARK-XXXXX][CORE] Add IO_URING transport mode
- [SPARK-56845] [K8S] Truncate ConfigMap names that exceed DNS subdomain limit
- [SPARK-56826][SQL] Skip PushVariantIntoScan for VariantGet with null-evaluating path
- Docs
- Scala not yet supported