emr-cluster topic

List emr-cluster repositories

spark-boilerplate

10
Stars
3
Forks
Watchers

A boilerplate for spark projects with docker support for local development and scripts for emr support.

Repo-2019

136
Stars
74
Forks
Watchers

BERT, AWS RDS, AWS Forecast, EMR Spark Cluster, Hive, Serverless, Google Assistant + Raspberry Pi, Infrared, Google Cloud Platform Natural Language, Anomaly detection, Tensorflow, Mathematics

goodreads_etl_pipeline

1.2k
Stars
209
Forks
Watchers

An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.

terraform-aws-emr-cluster

71
Stars
82
Forks
Watchers

Terraform module to provision an Elastic MapReduce (EMR) cluster on AWS

aws-dbs-refarch-datalake

75
Stars
31
Forks
Watchers

Reference Architectures for Datalakes on AWS

pyspark-on-aws-emr

24
Stars
13
Forks
Watchers

The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on writing pyspark code.

demo-code

28
Stars
22
Forks
Watchers

Bits of code I use during live demos

Udacity-Data-Engineer-nanodegree

72
Stars
72
Forks
Watchers

Classwork projects and home works done through Udacity data engineering nano degree

terraform-emr-spark-example

43
Stars
42
Forks
Watchers

An example Terraform project that will configure a Secure and Customizable Spark Cluster on Amazon EMR.

aws-etl

15
Stars
3
Forks
Watchers

This is an ETL application on AWS with general open sales and customer data that you can find here: https://github.com/camposvinicius/data/blob/main/AdventureWorks.zip, it's a zipped file with some .c...