beam
https://github.com/apache/beam
Java
Apache Beam is a unified programming model for Batch and Streaming data processing.
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Java not yet supported55 Subscribers
View all SubscribersAdd a CodeTriage badge to beam
Help out
- Issues
- Remove mypy ignore line from apache_beam/ml/inference/base.py once Dataclass is replaced with NamedTuple
- [Task]: grouping on categorical columns should not require Singleton partitioning
- [Bug]: var does not support per-level aggregation (level= kwarg)
- [Feature Request]: [Go SDK] Support OnWindowExpiration
- StructuredCoder should have abstract getComponents function.
- [Task]: Add a first-class pipeline option for `--experiments=enable_stackdriver_agent_metrics`
- [Task]: Review remaining jira references in the codebase
- Support TestStream on the fn_api_runner
- Make FnApiRunner work by executing ready elements instead of stages
- Delete python DirectRunner after FnApiRunner fully supports Batch & Streaming
- Docs
- Java not yet supported