etl-pipeline topic

List etl-pipeline repositories

orchest

4.0k
Stars
250
Forks
Watchers

Build data pipelines, the easy way 🛠️

greenish

15
Stars
4
Forks
Watchers

Data monitoring tool, monitors the result, not the run

A dashboard is worth a thousand words => https://datastudio.google.com/reporting/755f3183-dd44-4073-804e-9f7d3d993315

etlflow

43
Stars
12
Forks
Watchers

EtlFlow is an ecosystem of functional libraries in Scala based on ZIO for running complex Auditable workflows which can interact with Google Cloud Platform, AWS, Kubernetes, Databases, SFTP servers, O...

socialetl

40
Stars
5
Forks
Watchers

Project for "Data pipeline design patterns" blog.

dlt-with-debug

37
Stars
7
Forks
Watchers

A lightweight helper utility which allows developers to do interactive pipeline development by having a unified source code for both DLT run and Non-DLT interactive notebook run.

e2e-data-engineering

180
Stars
83
Forks
Watchers

An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All compone...

onetl

61
Stars
6
Forks
Watchers

One ETL tool to rule them all

razv-data-engineering

29
Stars
3
Forks
Watchers

Portfolio of projects and studies conducted in data engineering.

DaFlow

26
Stars
13
Forks
Watchers

Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules.