data-engineering-nanodegree
data-engineering-nanodegree copied to clipboard
notebooks produced throughout the Udacity's Nanodegree Data Engineering Course
Data Engineering Nanodegree
You can check more about the nanodegree program out here: https://www.udacity.com/course/data-engineer-nanodegree--nd027
Purpose of this repository
Here you can take a look at all my exercise notebooks made throughout the nanodegree courses.
Also, you encounter the list of the projects developed throughout this course down below.
Courses Projects
1. Data Modeling Course
- Project 1: Data Modeling with Postgres: Sparkify song play logs ETL process
- Project 2: Data Modeling with Apache Cassandra: Sparkify song play logs ETL process
2. Cloud Data Warehouses
3. Data Lakes with Spark
- Project 4: Sparkify's Data Lake ELT process
4. Data Pipelines with Airflow
- Project 5: Sparkify's Event Logs Data Pipeline
5. Capstone Project
- Work around the world: a simple and unified dataset with jobs from major tech jobs lists