data-engineering-pipeline topic
goodreads_etl_pipeline
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
Udacity-Data-Engineering-Projects
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
versatile-data-kit
One framework to develop, deploy and operate data workflows with Python and SQL.
Movalytics-Data-Warehouse
Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow
disaster-response-pipeline
ETL pipeline combined with supervised learning and grid search to classify text messages sent during a disaster event
dataflow-ops
Project demonstrating how to automate Prefect 2.0 deployments to AWS ECS Fargate
prefect-aws-lambda
Deploy a Prefect flow to serverless AWS Lambda function
prefect-deployment-patterns
Code examples showing flow deployment to various types of infrastructure
Udacity-Data-Engineer-nanodegree
Classwork projects and home works done through Udacity data engineering nano degree
Apache-Spark-Guide
Apache Spark Guide