spark
https://github.com/apache/spark
Scala
Apache Spark - A unified analytics engine for large-scale data processing
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Scala not yet supported75 Subscribers
View all SubscribersAdd a CodeTriage badge to spark
Help out
- Issues
- Is Spark limited to split the Parquet read granularity by Row Group level only?
- [SPARK-28587][SQL] Route JDBC partition bound literals through JdbcDialect.compileValue
- [SPARK-56726][CONNECT] Add Dataset.getNumPartitions to Spark Connect client
- [SPARK-56734][CORE] Optimize RocksDBPersistenceEngine with Column Families and zero-allocation prefix matching
- fix: the hive thrift server's ldap authentication pr... in...
- Predicate Pushdown in Spark Structured Streaming (DataSource V2).
- Support Filter pushdown in Spark Structured Streaming
- [SPARK-56413][SQL][UDF] gRPC implementation of the UDF worker protocol dispatcher
- Error: Illegal Parquet type: FIXED_LEN_BYTE_ARRAY (UUID)
- [SPARK-54419][SQL] Avoid expanding expensive alias chains in optimizer
- Docs
- Scala not yet supported