etl topic
ReplicaDB
ReplicaDB is open source tool for database replication, designed for efficiently transferring bulk data between relational and non-relational databases
react-csv
React components to build CSV files on the fly basing on Array/literal object of data
monstache
a go daemon that syncs MongoDB to Elasticsearch in realtime. you know, for search.
zingg
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
automate-dv
Hyperion pre installed on Raspberry Pi OS Lite
automate-dv
A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineering tool, registered trademark of dbt Labs)
kestra
:zap: Workflow Automation Platform. Orchestrate & Schedule code in any language, run anywhere, 500+ plugins. Alternative to Zapier, Rundeck, Camunda, Airflow...
mara-pipelines
A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow
go-streams
A lightweight stream processing library for Go
riko
A Python stream processing engine modeled after Yahoo! Pipes
omniparser
omniparser: a native Golang ETL streaming parser and transform library for CSV, JSON, XML, EDI, text, etc.