amazon-emr topic

List amazon-emr repositories

aws-dbs-refarch-datalake

75
Stars
31
Forks
Watchers

Reference Architectures for Datalakes on AWS

demo-code

28
Stars
22
Forks
Watchers

Bits of code I use during live demos

modern-data-lake-storage-layers

44
Stars
27
Forks
Watchers

Jupyter notebooks and AWS CloudFormation template to show how Hudi, Iceberg, and Delta Lake work

dataflow-runner

19
Stars
8
Forks
Watchers

Run templatable playbooks of Hadoop/Spark/et al jobs on Amazon EMR

aws-airflow-demo

42
Stars
14
Forks
Watchers

Project files for the post: Running PySpark Applications on Amazon EMR using Apache Airflow: Using the new Amazon Managed Workflows for Apache Airflow (MWAA) on AWS.

emr-demo

38
Stars
17
Forks
Watchers

Project files for the post: Running PySpark Applications on Amazon EMR: Methods for Interacting with PySpark on Amazon Elastic MapReduce.

terraform-emr-spark-example

43
Stars
42
Forks
Watchers

An example Terraform project that will configure a Secure and Customizable Spark Cluster on Amazon EMR.

amazon-emr-with-delta-lake

17
Stars
12
Forks
Watchers

Amazon EMR Notebook to show how to read from and write to Delta tables with Amazon EMR

amazon-emr-cli

33
Stars
10
Forks
Watchers

A command-line interface for packaging, deploying, and running your EMR Serverless Spark jobs

amazon-emr-vscode-toolkit

28
Stars
3
Forks
Watchers

A VS Code Extension to make it easier to manage and develop Spark jobs on EMR