data-pipeline topic

List data-pipeline repositories

ob_bulkstash

116
Stars
16
Forks
Watchers

Bulk Stash is a docker rclone service to sync, or copy, files between different storage services. For example, you can copy files either to or from a remote storage services like Amazon S3 to Google C...

hookah

92
Stars
9
Forks
Watchers

A cross-platform tool for data pipelines.

datajob

110
Stars
19
Forks
Watchers

Build and deploy a serverless data pipeline on AWS with no effort.

instill-core

2.1k
Stars
90
Forks
Watchers

ðŸ”Ū Instill Core is a full-stack AI infrastructure tool for data, model and pipeline orchestration, designed to streamline every aspect of building versatile AI-first applications

delta-architecture

71
Stars
16
Forks
Watchers

Streaming data changes to a Data Lake with Debezium and Delta Lake pipeline

thedataengineeringbook

105
Stars
43
Forks
Watchers

The Data Engineering Book - āļŦāļ™āļąāļ‡āļŠāļ·āļ­āļ§āļīāļĻāļ§āļāļĢāļĢāļĄāļ‚āđ‰āļ­āļĄāļđāļĨ āļ‚āļ­āļ‡āļ„āļ™āđ„āļ—āļĒ āđ€āļžāļ·āđˆāļ­āļ„āļ™āđ„āļ—āļĒ

augraphy

308
Stars
40
Forks
Watchers

Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes

Data-Engineering-Nanodegree

75
Stars
51
Forks
Watchers

This repository holds the python files and notebooks associated with the Udacity Data Engineering Nanodegree.

ATOM

151
Stars
14
Forks
Watchers

Automated Tool for Optimized Modelling