data-pipeline topic

List data-pipeline repositories

data-science-on-gcp

1.3k
Stars
709
Forks
Watchers

Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017

scalable-data-science-platform

163
Stars
29
Forks
Watchers

Content for architecting a data science platform for products using Luigi, Spark & Flask.

tributary

428
Stars
39
Forks
Watchers

Streaming reactive and dataflow graphs in Python

mobydq

245
Stars
59
Forks
Watchers

:whale: Tool to automate data quality checks on data pipelines

flupy

189
Stars
15
Forks
Watchers

Fluent data pipelines for python and your shell

memphis

3.2k
Stars
211
Forks
21
Watchers

Memphis.dev is a highly scalable and effortless data streaming platform

watchmen-matryoshka-doll

131
Stars
21
Forks
Watchers

Watchmen Platform is a low code data platform for data pipeline, meta data management , analysis, and quality management

scicloj.ml

200
Stars
13
Forks
Watchers

A Clojure machine learning library

glue-public

112
Stars
4
Forks
Watchers

:fire: Data pipeline and automation tool.