data-pipelines topic
patterns-devkit
Data pipelines from re-usable components
udacity-data-eng-proj-1
Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation, validation and loading of data from S3 -> Redshift -> S3
pipebird
Pipebird is open source infrastructure for securely sharing data with customers.
ml-in-production
The practical use-cases of how to make your Machine Learning Pipelines robust and reliable using Apache Airflow.
streams-explorer
Explore Apache Kafka data pipelines in Kubernetes.
data_engineer_interview_challenges
Found a data engineering challenge or participated in a selection process ? Share with us!
fsharp-data-processing-pipeline
Provides an extensible solution for creating Data Processing Pipelines in F#.
AirflowDataPipeline
Example of an ETL Pipeline using Airflow
stepist
Framework for data processing
neon-workshop
A Pachyderm deep learning tutorial for conference workshops