dataengineering topic

List dataengineering repositories

metadata-guardian

18
Stars
1
Forks
Watchers

Provide an easy way with Python to protect your data sources by searching its metadata. 🛡️

OpenMetadata

8.3k
Stars
1.6k
Forks
8.3k
Watchers

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team colla...

meltano

1.6k
Stars
145
Forks
Watchers

Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.

data-diff

2.9k
Stars
240
Forks
Watchers

Compare tables within or across databases

zingg

902
Stars
109
Forks
Watchers

Scalable identity resolution, entity resolution, data mastering and deduplication using ML

chaos_genius

707
Stars
81
Forks
Watchers

ML powered analytics engine for outlier detection and root cause analysis.

dagu

3.0k
Stars
223
Forks
3.0k
Watchers

A self-contained, lightweight workflow engine with a built-in Web UI. Define workflows in a simple, declarative YAML format. Execute them anywhere, compose complex pipelines, and distribute tasks. Zer...

automate-dv

466
Stars
113
Forks
Watchers

Hyperion pre installed on Raspberry Pi OS Lite

automate-dv

466
Stars
113
Forks
Watchers

A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineering tool, registered trademark of dbt Labs)

aws-orbit-workbench

128
Stars
26
Forks
Watchers

A Data Platform built for AWS, powered by Kubernetes.

data-engineering-interviews

63
Stars
13
Forks
Watchers

Data engineering interviews Q&A for data community by data community