pySpark_tutorial
pySpark_tutorial copied to clipboard
Implementation of Spark code in Jupyter notebook. Topics include: RDDs and DataFrame, exploratory data analysis (EDA), handling multiple DataFrames, visualization, Machine Learning
pySpark_tutorial
List of contents
- RDDs and DataFrame
- Exploratory data analysis
- Handeling multiple dataframes
- Visualization
- Machine learning