data-pipelines topic

List data-pipelines repositories

dagger

12
Stars
4
Forks
Watchers

Define sophisticated data pipelines with Python and run them on different distributed systems (such as Argo Workflows).

dud

166
Stars
6
Forks
Watchers

A lightweight CLI tool for versioning data alongside source code and building data pipelines.

orchest

4.0k
Stars
250
Forks
Watchers

Build data pipelines, the easy way 🛠️

dagster

10.4k
Stars
1.3k
Forks
Watchers

An orchestration platform for the development, production, and observation of data assets.

elementary

1.8k
Stars
144
Forks
Watchers

The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.

optimus

737
Stars
153
Forks
Watchers

Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.

meltano

1.6k
Stars
143
Forks
Watchers

Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.

deeplake

7.8k
Stars
596
Forks
Watchers

Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop....

versatile-data-kit

411
Stars
55
Forks
Watchers

One framework to develop, deploy and operate data workflows with Python and SQL.

dataform

794
Stars
146
Forks
Watchers

Dataform is a framework for managing SQL based data operations in BigQuery