data-pipeline topic
data-science-on-gcp
Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
scalable-data-science-platform
Content for architecting a data science platform for products using Luigi, Spark & Flask.
tributary
Streaming reactive and dataflow graphs in Python
mobydq
:whale: Tool to automate data quality checks on data pipelines
flupy
Fluent data pipelines for python and your shell
memphis
Memphis.dev is a highly scalable and effortless data streaming platform
watchmen-matryoshka-doll
Watchmen Platform is a low code data platform for data pipeline, meta data management , analysis, and quality management
scicloj.ml
A Clojure machine learning library
piperider
Code review for data in dbt
glue-public
:fire: Data pipeline and automation tool.