beam
beam copied to clipboard
Apache Beam is a unified programming model for Batch and Streaming data processing.
This is an example of a regression model using an sklearn pipeline. This illustrates how to separate examples based on what features are available in the example and then using...
- show github button on resolutions between 1024 and 640 - change width for svg on resolutions between 1024 and 640 ____________________________________________________________________________________________________________________________ before:  after:  ------------------------ Thank you for...
Currently close happens in processElement which is per-window. If there are many windows firing this can throttle throughput waiting for IO instead of closing in parallel in finishBundle. Imported from...
### What happened? Seeing this [error](https://ci-beam.apache.org/job/beam_PostCommit_Python38/2960/testReport/apache_beam.runners.dataflow.dataflow_exercise_metrics_pipeline_test/ExerciseMetricsPipelineTest/test_metrics_it/) in `test_metrics_it`. There were no changes at the time of first failure (run 2960). ``` self = @pytest.mark.it_postcommit def test_metrics_it(self): result = self.run_pipeline() errors...
### What happened? Seeing this error in Python [PostCommits](https://ci-beam.apache.org/job/beam_PostCommit_Python37_PR/411/console) ``` 21:36:07 Exception in thread read_grpc_client_inputs: 21:36:07 Traceback (most recent call last): 21:36:07 File "/usr/lib/python3.7/threading.py", line 926, in _bootstrap_inner 21:36:07 self.run()...
### What happened? Experiencing a weird BigQuery error without any traces of what exactly is a problem. This only occurs when we start draining the job, I don't think that...
### What needs to happen? - [ ] #22339 - [ ] #22340 - [ ] #22341 ### Issue Priority Priority: 3 ### Issue Component Component: beam-playground
**Please** add a meaningful description for your change here ------------------------ addresses #22571 Thank you for your contribution! Follow this checklist to help us incorporate your contribution quickly and easily: -...
Run VR tests for Spark streaming runner rather than custom tests (test are already run as part of the "normal" unit test run). If `forceStreaming` is set to `true`, the...
### What would you like to happen? RIght now, the beam python sdk uses `cibuildwheel==1.11.0` that use get-pip.py to install the PyPI dependencies. There has been some issues related to...