aws-emr topic
sparksteps
:star: CLI tool to launch Spark jobs on AWS EMR
spark_python_ml_examples
Spark 2.0 Python Machine Learning examples
spark_scala_ml_examples
Spark 2.0 Scala Machine Learning examples
pyspark-on-aws-emr
The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on writing pyspark code.
demo-code
Bits of code I use during live demos
rheoceros
Cloud-based AI / ML workflow and data application development framework
aws-auto-terminate-idle-emr
AWS Auto Terminate Idle AWS EMR Clusters Framework is an AWS based solution using AWS CloudWatch and AWS Lambda using a Python script that is using Boto3 to terminate AWS EMR clusters that have been i...
aws-data-pipeline
A batch processing data pipeline, using AWS resources (S3, EMR, Redshift, EC2, IAM), provisioned via Terraform, and orchestrated from locally hosted Airflow containers. The end product is a Superset d...
terraform-aws-emr
Terraform module to create AWS EMR resources 🇺🇦