beam
https://github.com/apache/beam
Java
Apache Beam is a unified programming model for Batch and Streaming data processing.
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Java not yet supported55 Subscribers
View all SubscribersAdd a CodeTriage badge to beam
Help out
- Issues
- Convert all type comments to type annotations with com2ann
- Add support for inferring Beam schemas from dataclasses
- Parquetio should produce a schema'd PCollection
- Fix 'RuntimeValueProvider' object has no attribute 'projectId' error in _CustomBigQuerySource.split
- Make InteractiveRunner work without runner api roundtrip
- Support using FlinkRunner with the beam_sql magic
- Supports JSON in SnowflakeIO
- Incomplete argument validation in SnowflakeIO
- DirectRunner does not update reference to currentRestriction when running in SDF
- DropFields PTransform automatically unnesting remaining fields
- Docs
- Java not yet supported