beam icon indicating copy to clipboard operation
beam copied to clipboard

Apache Beam is a unified programming model for Batch and Streaming data processing.

Results 933 beam issues
Sort by recently updated
recently updated
newest added

Adding Python API for Managed transforms, similar to the [Java API](https://github.com/ahmedabu98/beam/blob/bb4ad994304d0bb5f31196f3bc2e4e3616ae59bb/sdks/java/managed/src/main/java/org/apache/beam/sdk/managed/Managed.java#L76).

python
java
model
io
kafka

There are folks out there trying to run Beam Python on Databricks [1]. While there is documentation out there for the Java SDK [2], Python is more involved because the...

runners
spark
portability
P3
wish

PubsubMessageWithTopicCoder should return PubsubMessageWithTopicCoder PubsubMessageWithAttributesAndMessageIdCoder. While investigating Dynamic Destinations on Direct runner I found out that PubsubMessageWithTopicCoder is never used and topic is lost and pipeline fails. fixes #31679

java
io
gcp
Next Action: Reviewers

### What happened? Referencing [JdbcIo's doc](https://beam.apache.org/releases/javadoc/current/org/apache/beam/sdk/io/jdbc/JdbcIO.html) in the section for Parallel reading from a JDBC datasource, It mentions to use either of these types of column for paritioning `Beam supports...

java
flink
P3
bug
awaiting triage

Bumps [github.com/aws/aws-sdk-go-v2/feature/s3/manager](https://github.com/aws/aws-sdk-go-v2) from 1.13.8 to 1.17.2. Changelog Sourced from github.com/aws/aws-sdk-go-v2/feature/s3/manager's changelog. Release (2022-12-02) General Highlights Dependency Update: Updated to the latest SDK module versions Module Highlights github.com/aws/aws-sdk-go-v2/service/appsync: v1.17.0 Feature: Fixes...

go
dependencies

WIP: Added insertion and enrichment pipeline for RAG (Retrival Augmented Generation) usecase #GSOC-259 ------------------------ Thank you for your contribution! Follow this checklist to help us incorporate your contribution quickly and...

examples

This allows one to define new transforms with YAML that can be imported and used in other pipelines. ------------------------ Thank you for your contribution! Follow this checklist to help us...

python
yaml

This sets heartbeat work by default and will rollout along next beam release. This CL also refactors and fixed some unit test issues.

runners
dataflow
Next Action: Reviewers

Fixes #31354 ------------------------ Thank you for your contribution! Follow this checklist to help us incorporate your contribution quickly and easily: - [ ] Mention the appropriate issue in your description...

java
io
gcp

### What would you like to happen? I want to create a new UDF which supports a lambda parameter and a Row parameter, for example ```java static class LambdaUDFA implements...

java
new feature
P2
awaiting triage