beam
beam copied to clipboard
Apache Beam is a unified programming model for Batch and Streaming data processing.
Fixes #22543 When using multiple partitions for a BigQuery `FILE_LOADS` write, the user's dynamic destination type isn't properly passed into `UpdateSchemaDestination`. This leads to a `ClassCastException` when attempting to get...
### What needs to happen? The output of this task is a [TypedSchemaTransformProvider](https://github.com/apache/beam/blob/master/sdks/java/core/src/main/java/org/apache/beam/sdk/schemas/transforms/TypedSchemaTransformProvider.java) extension for reading from and writing text files using the TextIO.Read and TextIO.Write transforms, respectively. Briefly described,...
### What would you like to happen? Currently transforms that return a TimestampedValue need to be typed as plain "TimestampedValue" rather than generic "TimestampedValue[T]" so all underlying information about what...
This will reduce OOMs in the case where we are adding new writes faster then we are able to clean-up older ones across elements. ------------------------ Thank you for your contribution!...
### What happened? I was checking this example ``` sdks/python/apache_beam/examples/snippets/snippets.py ``` And I think there might be an issue with this line of code ``` file_handle.seek(tracker.current_restriction.start()) ``` It should be...
------------------------ Thank you for your contribution! Follow this checklist to help us incorporate your contribution quickly and easily: - [ ] [**Choose reviewer(s)**](https://beam.apache.org/contribute/#make-your-change) and mention them in a comment (`R:...
### What happened? Was investigating #22115 and find this interesting observation pytest results with time consuming info: ``` ---------------------------------- Original: =========================================================================================== slowest 100 test durations ============================================================================================ 10.01s call apache_beam/runners/portability/fn_api_runner/fn_runner_test.py::FnApiRunnerTestWithDisabledCaching::test_pack_combiners 9.82s...
### What would you like to happen? The current implementation of RunInference provides model handlers for PyTorch and Sklearn models. These handlers assume that the method to call for inference...
### What would you like to happen? Acquiring scientific dependencies in the Python ecosystem is challenging. `pip` and `apt-get` alone are not sufficient, for various reasons, the most significant of...
### What needs to happen? Please consider upgrading the dependency org.apache.samza. The current version is 1.5.0. The latest version is 1.7.0 ### Issue Priority Priority: 2 ### Issue Component Component:...