DataflowJavaSDK icon indicating copy to clipboard operation
DataflowJavaSDK copied to clipboard

Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.

Results 52 DataflowJavaSDK issues
Sort by recently updated
recently updated
newest added
trafficstars

I took the title from a previous issue #105 that has been resolved and it sounds just like my issue. I have a pipeline that reads from unbonded pubsubio and...

Been recently developing a Dataflow consumer which read from a PubSub subscription and outputs to Parquet files the combination of all those objects grouped within the same window. While I...

This is going to be quite a hit or miss question as I don't really know which context or piece of code to give you as it is a situation...

Could you please revert the deletion of source code? This code is still in production and the repo should exist and be public. Once that's no longer true it should...

To reproduce: - Upload a simple file (10000 sequential numbers, one per line) to Google storage specifying GZIP compression `gsutil cp -Z numbers.txt gs:///numbers.txt`. - Execute a simple dataflow just...

tracking

Following is the code that writes to BigQuery: ``` BigQueryIO.writeTableRows() .to(destination) .withCreateDisposition(CREATE_IF_NEEDED) .withWriteDisposition(WRITE_APPEND) .withSchema(tableSchema) .expand(tableRows); ``` Here's the destination's implementation: ``` public TableDestination apply(ValueInSingleWindow input) { String partition = timestampExtractor.apply(input.getValue())...

Dataflow SDK version: 2.4.0 I haven't tested with other attribute but when we use these attribute for de-duplication and timestaming we got null PubsubMessage's attribute For example ``` PubsubIO.readMessages() .fromSubscription("mine")...

The new Google API Client Library, version 1.23.0, appears to cause problems with the Dataflow Java SDK when submitting and/or running jobs. This appears to affect Dataflow Java SDKs in...

trying to run BigQueryTornadoes and keep having this error : [ERROR] Failed to execute goal org.codehaus.mojo:exec-maven-plugin:1.4.0:java (default-cli) on project google-cloud-dataflow-java-examples-all: An exception occured while executing the Java clas s. null:...

Dataflow is not aboutable to find files packaged with my classes. I use Class.getResource("/data.json"). Stackdriver log shows it's looking for the file in /var/opt/google/dataflow/some-random-jar-name.jar!/data.json. When I ssh into the VM...