dataengineering topic

List dataengineering repositories

metadata-guardian

18
Stars
1
Forks
Watchers

Provide an easy way with Python to protect your data sources by searching its metadata. 🛡️

OpenMetadata

4.3k
Stars
855
Forks
Watchers

Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.

meltano

1.6k
Stars
143
Forks
Watchers

Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.

data-diff

2.9k
Stars
209
Forks
Watchers

Compare tables within or across databases

zingg

890
Stars
108
Forks
Watchers

Scalable identity resolution, entity resolution, data mastering and deduplication using ML

chaos_genius

705
Stars
81
Forks
Watchers

ML powered analytics engine for outlier detection and root cause analysis.

dagu

1.2k
Stars
123
Forks
Watchers

Yet another cron alternative with a Web UI, but with much more capabilities. It aims to solve greater problems.

automate-dv

461
Stars
111
Forks
Watchers

Hyperion pre installed on Raspberry Pi OS Lite

automate-dv

461
Stars
111
Forks
Watchers

A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineering tool, registered trademark of dbt Labs)

aws-orbit-workbench

128
Stars
26
Forks
Watchers

A Data Platform built for AWS, powered by Kubernetes.

data-engineering-interviews

56
Stars
12
Forks
Watchers

Data engineering interviews Q&A for data community by data community