DataflowJavaSDK icon indicating copy to clipboard operation
DataflowJavaSDK copied to clipboard

Empty files in GCS/BigQuery when running dataprep template

Open noamackerman opened this issue 7 years ago • 0 comments
trafficstars

Hi, I am using dataprep templates to invoke dataflow jobs using a cloud function with a GCS trigger (when new file arrive). There is constantly a strange behavior where the results of a run of multiple jobs concurrently create 0b files in GCS or empty tables in BigQuery (tried both). To reproduce it, use a dataprep template and invoke multiple concurrent jobs with it. (e.g. move multiple files to a GCS bucket that will trigger a cloud function who will use the SDK to create and run the jobs)

Highly apricate your help, Noam

noamackerman avatar Mar 12 '18 08:03 noamackerman