data-integration topic

List data-integration repositories

airbyte

9.5k
Stars
2.3k
Forks
174
Watchers

Data integration platform for ELT pipelines from APIs, databases & files to warehouses & lakes.

dagster

6.0k
Stars
752
Forks
Watchers

An orchestration platform for the development, production, and observation of data assets.

awesome-single-cell

2.3k
Stars
822
Forks
Watchers

Community-curated list of software packages and data resources for single-cell, including RNA-seq, ATAC-seq, etc.

jitsu

3.0k
Stars
185
Forks
Watchers

Jitsu is an open-source Segment alternative. Fully-scriptable data ingestion engine for modern data teams. Set-up a real-time data pipeline in minutes, not days

mara-pipelines

2.0k
Stars
97
Forks
Watchers

A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow

incubator-devlake

1.8k
Stars
277
Forks
Watchers

Apache DevLake is an open-source dev data platform to ingest, analyze, and visualize the fragmented data from DevOps tools, extracting insights for engineering excellence, developer experience, and co...

kuwala

655
Stars
47
Forks
Watchers

Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as...

seatunnel

6.0k
Stars
1.2k
Forks
167
Watchers

SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).

hudi

3.5k
Stars
1.6k
Forks
Watchers

Upserts, Deletes And Incremental Processing on Big Data.

chunjun

3.4k
Stars
1.5k
Forks
Watchers

A data integration framework