data-engineering-pipeline topic

List data-engineering-pipeline repositories

goodreads_etl_pipeline

1.2k
Stars
209
Forks
Watchers

An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.

Udacity-Data-Engineering-Projects

1.4k
Stars
464
Forks
Watchers

Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.

versatile-data-kit

413
Stars
54
Forks
Watchers

One framework to develop, deploy and operate data workflows with Python and SQL.

Movalytics-Data-Warehouse

117
Stars
27
Forks
Watchers

Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow

disaster-response-pipeline

16
Stars
13
Forks
Watchers

ETL pipeline combined with supervised learning and grid search to classify text messages sent during a disaster event

dataflow-ops

110
Stars
24
Forks
Watchers

Project demonstrating how to automate Prefect 2.0 deployments to AWS ECS Fargate

prefect-aws-lambda

35
Stars
6
Forks
Watchers

Deploy a Prefect flow to serverless AWS Lambda function

prefect-deployment-patterns

98
Stars
9
Forks
Watchers

Code examples showing flow deployment to various types of infrastructure

Udacity-Data-Engineer-nanodegree

72
Stars
72
Forks
Watchers

Classwork projects and home works done through Udacity data engineering nano degree