beam icon indicating copy to clipboard operation
beam copied to clipboard

Apache Beam is a unified programming model for Batch and Streaming data processing.

Results 933 beam issues
Sort by recently updated
recently updated
newest added

After 2.40.0 (this PR https://github.com/apache/beam/pull/17423), there are three to get the failed inserts from a `BigQueryIO` Write, as it was now added a specific tag/PCollection when using the write method...

java
io
gcp

Beam 2.38.0 I encounter the following stacktrace when I try to drain a Dataflow pipeline. During the normal execution the pipeline is flawless, but gets stuck during draining, I don't...

runners
dataflow
bug
P2
awaiting triage

**Please** add a meaningful description for your change here ------------------------ Thank you for your contribution! Follow this checklist to help us incorporate your contribution quickly and easily: - [ ]...

runners
samza

### What needs to happen? Samza Runner does not support synchronized process time timer now. The ticket is to add the support in Samza Runner. ### Issue Priority Priority: 3...

runners
samza
P3
task
awaiting triage

Fixes #19357 Was considering add another suppression in #22479 and found these were all outdated **Please** add a meaningful description for your change here ------------------------ Thank you for your contribution!...

java
build

- PR that implements a notebook to demonstrate the usage of the beam dataframe API as a preprocessing tool for ML training WIP: - [ ] **Find a method to...

examples

Unfortunately I haven't been able to diagnose the exact issue here or come up with a minimal repro. I just have some code to reproduce in https://github.com/apache/beam/pull/16445. That PR adds...

stale
python
core
bug
P2

### What would you like to happen? We would like to use the DataFrame API to perform one-hot encoding on categorical columns. Currently, this can be done with `pd.get_dummies()` method...

new feature
P2
dsl
dataframe

Part of #20810 This PR adds a wrapper function 'read_gbq', that wraps a wrapper PTransform 'ReadGbq'. 'ReadGbq' allows users to Read from BQ, convert the schema into a Beam Schema,...

python
io
gcp
Next Action: Reviewers

TODO: - [x] Get tests passing - [x] Clarify separation of concerns between pandas_type_compatibility and dataframe.schemas - [x] Address TODOs (mostly error string comments) - [x] Remove duplicated logic in...

python
go
model