Google Cloud Dataproc

Results 8 repositories owned by Google Cloud Dataproc

hadoop-connectors

276
Stars
231
Forks
Watchers

Libraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.

spark-bigquery-connector

352
Stars
190
Forks
Watchers

BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.

initialization-actions

584
Stars
515
Forks
Watchers

Run in all nodes of your cluster before the cluster starts - lets you customize your cluster

cloud-dataproc

192
Stars
122
Forks
Watchers

Cloud Dataproc: Samples and Utils

bdutil

109
Stars
98
Forks
Watchers

[DEPRECATED] Script used to manage Hadoop and Spark instances on Google Compute Engine

custom-images

31
Stars
59
Forks
Watchers

Tools for creating Dataproc custom images

hive-bigquery-storage-handler

19
Stars
11
Forks
Watchers

Hive Storage Handler for interoperability between BigQuery and Apache Hive

spark-spanner-connector

16
Stars
12
Forks
Watchers

Cloud Spanner Connector for Apache Spark