emr-bootstrap-pyspark
emr-bootstrap-pyspark copied to clipboard
Quickstart PySpark with Anaconda on AWS/EMR
EMR Bootstrap PySpark with Anaconda
This code should help to jump start PySpark with Anaconda on AWS.
Getting Started
conda env create -f environment.yml- Fill in all the required information e.g. aws access key, secret acess key etc. into the
config.yml.examplefile and rename it toconfig.yml - Run it
python emr_loader.py
Requirements
Copyright
See LICENSE for details. Copyright (c) 2016 Dat Tran.