data-pipeline topic

List data-pipeline repositories

:sparkles: The present project is a basic process pipeline of extrating, transforming, loading, analysing and presenting. All of that was made by using suitable tools of web scraping, data analysis/pr...

airflow-testing-ci-workflow

69
Stars
8
Forks
Watchers

(project & tutorial) dag pipeline tests + ci/cd setup

Data-Engineering-HowTo

2.4k
Stars
351
Forks
Watchers

A list of useful resources to learn Data Engineering from scratch

gusty

126
Stars
4
Forks
Watchers

Making DAG construction easier

snowplow

6.3k
Stars
1.2k
Forks
Watchers

The enterprise-grade behavioral data engine (web, mobile, server-side, webhooks), running cloud-natively on AWS and GCP

nonechucks

361
Stars
24
Forks
Watchers

Deal with bad samples in your dataset dynamically, use Transforms as Filters, and more!

elementary

845
Stars
54
Forks
Watchers

Open-source data observability for analytics engineers.

doit

1.5k
Stars
161
Forks
Watchers

task management & automation tool

kestra

3.0k
Stars
173
Forks
Watchers

Kestra is an infinitely scalable orchestration and scheduling platform, creating, running, scheduling, and monitoring millions of complex pipelines.