beam
beam copied to clipboard
Apache Beam is a unified programming model for Batch and Streaming data processing.
Imported from Jira [BEAM-13049](https://issues.apache.org/jira/browse/BEAM-13049). Original Jira may contain additional context. Reported by: aromanenko. Subtask of issue #21253
Fixing the broken link to the Retry Policy blogpost ------------------------ Thank you for your contribution! Follow this checklist to help us incorporate your contribution quickly and easily: - [x] [**Choose...
Following test codes cause `java.lang.IllegalArgumentException: Except when using GlobalWindows, calling .triggering() to specify a trigger requires that the allowed lateness be specified using .withAllowedLateness() to set the upper bound on...
This PR address #21412 with a PubsubSchemaTransformWriteConfiguration implementation. It's design goals are to like-for-like replicate PubsubSchemaIOTransform write configuration details. Subsequent to this PR's approval/merge, the plan is to implement the...
R: @damccorm @pabloem **Please** add a meaningful description for your change here ------------------------ Thank you for your contribution! Follow this checklist to help us incorporate your contribution quickly and easily:...
GitHub Actions Tests Status (on master branch) ------------------------------------------------------------------------------------------------ [](https://github.com/apache/beam/actions?query=workflow%3A%22Build+python+source+distribution+and+wheels%22+branch%3Amaster+event%3Aschedule) [](https://github.com/apache/beam/actions?query=workflow%3A%22Python+Tests%22+branch%3Amaster+event%3Aschedule) [](https://github.com/apache/beam/actions?query=workflow%3A%22Java+Tests%22+branch%3Amaster+event%3Aschedule) [](https://github.com/apache/beam/actions?query=workflow%3A%22Go+tests%22+branch%3Amaster+event%3Aschedule) See [CI.md](https://github.com/apache/beam/blob/master/CI.md) for more information about GitHub Actions CI.
This got rid of about half of the overhead of activating the metrics container for the current thread. The updated implementation is about 48 ms for the two calls to...
KafkaIO should raise an error if both .withReadCommitted() and .commitOffsetsInFinalize() are used
Read committed tells KafkaIO to only read messages that are already committed which means that committing offsets in finalize is a no-op. Users should be using one or the other...
This change: * swaps java imports * fixes a new deprecation issue TextFormat.printToString -> TextFormat.printer().printToString caused by the update * deletes the old 1.43.2 package ------------------------ Thank you for your...
### What would you like to happen? I'm currently using a streaming Apache Beam pipeline on a Dataflow Runner with an attached GPU to perform real-time inference. We ingest Pub/Sub...