terraform-emr-pyspark
terraform-emr-pyspark copied to clipboard
Quickstart PySpark with Anaconda on AWS/EMR using Terraform
Terraform + EMR Bootstrap PySpark with Anaconda
This code should help to jump start PySpark with Anaconda on AWS using Terraform.
Getting Started
- Install Terraform on Mac:
brew install terraform
- Adjust the scripts (
bootstrap_actions.sh
andpyspark_quick_setup.sh
) inscripts
if necessary - Set parameters in
terraform.tfvars
- Start cluster:
terraform init
terraform apply
- Destroy cluster:
terraform destroy
Notes
- Configure AWS on your local machine:
aws configure
-
AWS instance cost for
eu-central-1
Maintainers
- Dat Tran, github: datitran
Copyright
See LICENSE for details.