DataflowJavaSDK
DataflowJavaSDK copied to clipboard
Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
There are some cases where the SDK will allow step names to be defined with the Slash '/' character. These should be handled in the SDK so that steps are...
A common request is the ability in streaming pipelines to publish data to a file per window, like already exists for [`BigQueryIO`](https://github.com/GoogleCloudPlatform/DataflowJavaSDK/blob/2b200759edc25a6ec43ca277dac33fb62fddf475/sdk/src/main/java/com/google/cloud/dataflow/sdk/io/BigQueryIO.java#L152). We should add a `ParDo`-based example for this,...