beam icon indicating copy to clipboard operation
beam copied to clipboard

Apache Beam is a unified programming model for Batch and Streaming data processing.

Results 933 beam issues
Sort by recently updated
recently updated
newest added

Currently, when using a file based source implementation to read data from files we have 2 output options: - read only the content of the each line of each file...

java
io
extensions
xml
Next Action: Reviewers

Change classes that explicitly inject a `MetricTrackingWindmillServerStub` to take in a Function. This will give flexibility in later refactoring as we apply different ways to fetch the data without MetricTrackingWindmillServerStub....

runners
dataflow
Next Action: Reviewers

Removed flaky logic around waiting in tests. Removed thread.sleep and replaced with triggers via CountDownLatch R: @scwhittle @Abacn ------------------------ Thank you for your contribution! Follow this checklist to help us...

runners
dataflow
Next Action: Reviewers

Support Flink 1.17. Closes #29939. ------------------------ Thank you for your contribution! Follow this checklist to help us incorporate your contribution quickly and easily: - [ ] Mention the appropriate issue...

build
runners
flink

Saves the submission environment dependencies and stage it. Logs it along with the runtime dependencies. Fixes #28563 ------------------------ Thank you for your contribution! Follow this checklist to help us incorporate...

python
docker
Next Action: Reviewers

Handling DataStream, windowing, and more complex types will come in a future PR. ------------------------ Thank you for your contribution! Follow this checklist to help us incorporate your contribution quickly and...

runners
flink

This PR closes #28930 with a PTransform implementation that throttles a PCollection without using any external resources i.e. an external database, queue, etc. Please see #28930 for further details on...

java
io
Next Action: Reviewers

No need to submit or review will be thrown away once testing is complete. there will be smaller PRs with the changes. ------------------------ Thank you for your contribution! Follow this...

runners
dataflow

This pull request introduces stress tests for BigQueryIO, designed to assess the performance under various conditions. The stress tests simulate dynamic load increases and evaluate the behavior of BigQueryIO for...

Next Action: Reviewers

### What happened? Sometimes, a global window side input takes too long to update on a Dataflow job. The automatic model refresh feature of RunInference uses a pattern `WatchFilePattern` which...

python
dataflow
bug
P2