spark
https://github.com/apache/spark
Scala
Apache Spark - A unified analytics engine for large-scale data processing
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Scala not yet supported76 Subscribers
View all SubscribersAdd a CodeTriage badge to spark
Help out
- Issues
- [SPARK-38719][SQL] Test the error class: CANNOT_CAST_DATATYPE
- [SPARK-49642][SQL] Remove the ANSI config suggestion in DATETIME_FIEL…
- [SPARK-55902][PYTHON] Refactor SQL_ARROW_BATCHED_UDF to use ArrowStreamSerializer
- [SPARK-53675][PYTHON] Add str support in withColumn and withColumns in PySpark
- Fix potential NPE/IllegalArgumentException by wrapping toBoolean call with Try and defaulting to false for LEGACY_PARQUET_NANOS_AS_LONG config
- [SPARK-54878][SQL] Add sortKeys option to to_json function
- [SPARK-56001][SQL] Add INSERT INTO ... REPLACE ON/USING syntax
- dropDuplicates(columns) followed by ExceptAll results in INTERNAL_ERROR_ATTRIBUTE_NOT_FOUND
- [SPARK-55930][SPARK-55931][CONNECT] Byte-aware gRPC metadata size checks and errorClassFallback
- [SPARK-55897][SQL] Handle UserDefinedType in ColumnarRow, ColumnarBatchRow, and ColumnarArray get()
- Docs
- Scala not yet supported