data-integration topic

List data-integration repositories

airbyte

14.5k
Stars
3.7k
Forks
174
Watchers

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

dagster

11.9k
Stars
1.5k
Forks
123
Watchers

An orchestration platform for the development, production, and observation of data assets.

awesome-single-cell

2.9k
Stars
929
Forks
Watchers

Community-curated list of software packages and data resources for single-cell, including RNA-seq, ATAC-seq, etc.

jitsu

3.9k
Stars
270
Forks
Watchers

Jitsu is an open-source Segment alternative. Fully-scriptable data ingestion engine for modern data teams. Set-up a real-time data pipeline in minutes, not days

mara-pipelines

2.1k
Stars
102
Forks
Watchers

A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow

incubator-devlake

2.5k
Stars
480
Forks
Watchers

Apache DevLake is an open-source dev data platform to ingest, analyze, and visualize the fragmented data from DevOps tools, extracting insights for engineering excellence, developer experience, and co...

kuwala

771
Stars
52
Forks
Watchers

Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as...

seatunnel

7.5k
Stars
1.6k
Forks
172
Watchers

SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.

hudi

5.1k
Stars
2.4k
Forks
1.2k
Watchers

Upserts, Deletes And Incremental Processing on Big Data.

chunjun

3.9k
Stars
1.7k
Forks
Watchers

A data integration framework