beam
beam copied to clipboard
Apache Beam is a unified programming model for Batch and Streaming data processing.
### What needs to happen? Isolate usage of Scala APIs in ScalaInterop utility where unreadable in Java ### Issue Priority Priority: 2 ### Issue Component Component: runner-spark
### What would you like to happen? There's been a quiet request to have windows also be providable to CombineFn methods. Very similar to key aware CombineFns in issue #22211...
This PR aims to illustrates an example of performing online clustering using Stateful Processing and RunInference. The entire implementation is divided into two different pipelines: 1. `write_data_to_pubsub_pipeline` pushes data to...
@olehborysevych fixes #22876 ------------------------ Thank you for your contribution! Follow this checklist to help us incorporate your contribution quickly and easily: - [x] [**Choose reviewer(s)**](https://beam.apache.org/contribute/#make-your-change) and mention them in a...
In our pipeline, workers are sometimes stuck at step __WriteShardsIntoTempFiles__. We believe this happens when GCS fails to respond for more than 10 minutes and Google's VPC cuts the connection....
### What needs to happen? Create learning content and challenge for Tour of Beam::Introduction module consisting of following unit: - CoGroup - Join - Filter - Select - New motivating...
### What needs to happen? Adding the tags field to the example response ### Issue Priority Priority: 1 ### Issue Component Component: beam-playground
Here, I've created a minimum viable Apache Beam runner for Dask. My approach is to visit a Beam Pipeline an translate PCollections into Dask Bags, and PTransformations to Bag methods....
As part of the migration of Precommit and Postcommit Jobs from Jenkins to GA in self-hosted runners, this PR contains: Migrated workflow [job-postcommit-python-examples.yml](https://github.com/fernando-wizeline/beam/blob/gam-postcommit-python-sharding-examples/.github/workflows/job-postcommit-python-examples.yml) The migrated workflows were added to [CI.md](https://github.com/fernando-wizeline/beam/blob/gam-postcommit-python-sharding-examples/CI.md)...
Adds a tensorflow example to the other ml examples for run_inference in the notebook. See https://github.com/apache/beam/issues/23091 ------------------------ Thank you for your contribution! Follow this checklist to help us incorporate your...