beam
https://github.com/apache/beam
Java
Apache Beam is a unified programming model for Batch and Streaming data processing.
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Java not yet supported55 Subscribers
View all SubscribersAdd a CodeTriage badge to beam
Help out
- Issues
- Kinesis x-lang support depends on deprecated Kinesis IO (Aws Sdk v1)
- [Bug]: The top tesult for "Beam godocs" in Google points to an old Godocs page of Beam
- [Bug]: WriteToFiles in python leave few records in temp directory when writing to large number (100+) of files
- Beam worker closing gRPC connection with many workers and large shuffle sizes
- [Bug]: When using beam.io.kinesis.ReadDataFromKinesis, a java error is raised in DataFlowRunner
- [Bug]: Google Colab DataFrame example crashes with dependency conflict
- [Bug]: Python Lots of fn runner test items cost exactly 5 seconds to run
- [Bug]: WriteToBigquery Deadletter pattern does not work with FILE_LOADS method
- [Feature Request]: Expose TimerStateInternals.currentOutputWatermarkTime to allow for DoFns to handle elements behind the watemark differently
- [Bug]: File Descriptor Leak when upgrading from 2.35.0 to 2.36.0
- Docs
- Java not yet supported