data-engineering topic

List data-engineering repositories

data-diff

2.9k
Stars
240
Forks
Watchers

Compare tables within or across databases

dagu

1.2k
Stars
126
Forks
Watchers

Yet another cron alternative with a Web UI, but with much more capabilities. It aims to solve greater problems.

kestra

20.9k
Stars
1.8k
Forks
141
Watchers

:zap: Universal Workflow Orchestration Platform — Code in any language, run anywhere. 800+ plugins for data, infrastructure, and AI automation.

ploomber

3.4k
Stars
230
Forks
Watchers

The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️

Udacity-Data-Engineering-Projects

1.4k
Stars
464
Forks
Watchers

Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.

great_expectations

9.6k
Stars
1.5k
Forks
69
Watchers

Always know what to expect from your data.

incubator-devlake

2.5k
Stars
480
Forks
Watchers

Apache DevLake is an open-source dev data platform to ingest, analyze, and visualize the fragmented data from DevOps tools, extracting insights for engineering excellence, developer experience, and co...

Cookbook

13.2k
Stars
2.4k
Forks
Watchers

The Data Engineering Cookbook

Skytrax-Data-Warehouse

132
Stars
26
Forks
Watchers

A full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for cloud data warehouse and Metabase to serve the needs of data vi...

datart

1.8k
Stars
551
Forks
Watchers

Datart is a next generation Data Visualization Open Platform