data-pipeline topic
ob_bulkstash
Bulk Stash is a docker rclone service to sync, or copy, files between different storage services. For example, you can copy files either to or from a remote storage services like Amazon S3 to Google C...
hookah
A cross-platform tool for data pipelines.
datajob
Build and deploy a serverless data pipeline on AWS with no effort.
instill-core
ðŪ Instill Core is a full-stack AI infrastructure tool for data, model and pipeline orchestration, designed to streamline every aspect of building versatile AI-first applications
delta-architecture
Streaming data changes to a Data Lake with Debezium and Delta Lake pipeline
thedataengineeringbook
The Data Engineering Book - āļŦāļāļąāļāļŠāļ·āļāļ§āļīāļĻāļ§āļāļĢāļĢāļĄāļāđāļāļĄāļđāļĨ āļāļāļāļāļāđāļāļĒ āđāļāļ·āđāļāļāļāđāļāļĒ
augraphy
Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes
Data-Engineering-Nanodegree
This repository holds the python files and notebooks associated with the Udacity Data Engineering Nanodegree.
ATOM
Automated Tool for Optimized Modelling
patterns-devkit
Data pipelines from re-usable components