etl topic

List etl repositories

ReplicaDB

371
Stars
93
Forks
Watchers

ReplicaDB is open source tool for database replication, designed for efficiently transferring bulk data between relational and non-relational databases

react-csv

1.1k
Stars
264
Forks
Watchers

React components to build CSV files on the fly basing on Array/literal object of data

monstache

1.2k
Stars
177
Forks
Watchers

a go daemon that syncs MongoDB to Elasticsearch in realtime. you know, for search.

zingg

902
Stars
109
Forks
Watchers

Scalable identity resolution, entity resolution, data mastering and deduplication using ML

automate-dv

466
Stars
113
Forks
Watchers

Hyperion pre installed on Raspberry Pi OS Lite

automate-dv

466
Stars
113
Forks
Watchers

A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineering tool, registered trademark of dbt Labs)

kestra

14.3k
Stars
1.2k
Forks
169
Watchers

:zap: Workflow Automation Platform. Orchestrate & Schedule code in any language, run anywhere, 500+ plugins. Alternative to Zapier, Rundeck, Camunda, Airflow...

mara-pipelines

2.1k
Stars
102
Forks
Watchers

A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow

go-streams

1.8k
Stars
146
Forks
Watchers

A lightweight stream processing library for Go

riko

1.6k
Stars
77
Forks
Watchers

A Python stream processing engine modeled after Yahoo! Pipes

omniparser

643
Stars
54
Forks
Watchers

omniparser: a native Golang ETL streaming parser and transform library for CSV, JSON, XML, EDI, text, etc.