beam
https://github.com/apache/beam
Java
Apache Beam is a unified programming model for Batch and Streaming data processing.
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Java not yet supported55 Subscribers
View all SubscribersAdd a CodeTriage badge to beam
Help out
- Issues
- Overlapping sessions with zero allowed lateness due to window expiry rules
- Update DirectRunner's SDF implementation to support side outputs
- BigQuery Partitioned table creation/write fails when destination has partition decorator
- Add support for overriding PTransforms with multiple outputs
- DataflowRunner for python streaming uses portable containers
- Have the pipeline supply beam-sdks-java-harness instead of embedding it within the beam-sdks-java-container
- Prohibit stacked GBKs with accumulating mode
- DoFns should be torn down as part of an orderly shutdown
- Create a generator of finite-but-unbounded PCollection's for integration testing
- Port ElasticSearchIOTest off DoFnTester
- Docs
- Java not yet supported