dataengineering topic

List dataengineering repositories

metadata-guardian

18
Stars
1
Forks
Watchers

Provide an easy way with Python to protect your data sources by searching its metadata. 🛡️

OpenMetadata

5.4k
Stars
1.0k
Forks
Watchers

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team colla...

meltano

1.6k
Stars
145
Forks
Watchers

Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.

data-diff

2.9k
Stars
240
Forks
Watchers

Compare tables within or across databases

zingg

902
Stars
109
Forks
Watchers

Scalable identity resolution, entity resolution, data mastering and deduplication using ML

chaos_genius

707
Stars
81
Forks
Watchers

ML powered analytics engine for outlier detection and root cause analysis.

dagu

1.2k
Stars
126
Forks
Watchers

Yet another cron alternative with a Web UI, but with much more capabilities. It aims to solve greater problems.

automate-dv

466
Stars
113
Forks
Watchers

Hyperion pre installed on Raspberry Pi OS Lite

automate-dv

466
Stars
113
Forks
Watchers

A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineering tool, registered trademark of dbt Labs)

aws-orbit-workbench

128
Stars
26
Forks
Watchers

A Data Platform built for AWS, powered by Kubernetes.

data-engineering-interviews

63
Stars
13
Forks
Watchers

Data engineering interviews Q&A for data community by data community