Google Cloud Dataproc
Results
8
repositories owned by
Google Cloud Dataproc
hadoop-connectors
276
Stars
231
Forks
Watchers
Libraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.
spark-bigquery-connector
352
Stars
190
Forks
Watchers
BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.
initialization-actions
584
Stars
515
Forks
Watchers
Run in all nodes of your cluster before the cluster starts - lets you customize your cluster
bdutil
109
Stars
98
Forks
Watchers
[DEPRECATED] Script used to manage Hadoop and Spark instances on Google Compute Engine
custom-images
31
Stars
59
Forks
Watchers
Tools for creating Dataproc custom images
hive-bigquery-storage-handler
19
Stars
11
Forks
Watchers
Hive Storage Handler for interoperability between BigQuery and Apache Hive