dataproc-templates icon indicating copy to clipboard operation
dataproc-templates copied to clipboard

Dataproc templates and pipelines for solving simple in-cloud data tasks

Results 104 dataproc-templates issues
Sort by recently updated
recently updated
newest added

Eg: gcs.spanner.input.location=gs://dataproc-templates/data/avro/empavro gcs.gcs.input.location=gs://dataproc-templates/integration-testing/gcstogcs/csvtoavro/csv/cities.csv gcs.bigtable.input.location=gs://dataproc-templates/data/csv/GCSToBigTable_cities.csv gcs.bigquery.input.location=gs://dataproc-templates/integration-testing/gcstobigquery/cities.csv text.bigquery.input.location=gs://dataproc-templates/data/deflate should be checked in java/.ci/ Then in `stage('Reset Resources')` of JenkinsFile it should be overwritten before running the tests.

java
automation

Publish a blog for running the [WordCount example](https://github.com/GoogleCloudPlatform/dataproc-templates/blob/main/java/src/main/java/com/google/cloud/dataproc/templates/word/WordCount.java) using Dataproc Serverless (Java)

publishing
java

Currently we are using export JARS option to provide spark bigquery connector jar file. It is available through maven central [repository](https://mvnrepository.com/artifact/com.google.cloud.spark/spark-bigquery-connector-common). Update pom.xml file to be available from there.

enhancement
java

Our spark bigquery connector is supporting append and overwrite mode based on [documentation](https://github.com/GoogleCloudDataproc/spark-bigquery-connector?tab=readme-ov-file#writing-data-to-bigquery). We need to thoroughly test and validate various scenarios and update README.md file for writing to BigQuery...

documentation
python
java