etl-pipeline topic
tech.ml.dataset
A Clojure high performance data processing system
etlbox
A lightweight ETL (extract, transform, load) library and data integration toolbox for .NET.
ApacheSpark
This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which we need in our real life experience as a data engineer. We wil...
dig-etl-engine
Download DIG to run on your laptop or server.
patterns-devkit
Data pipelines from re-usable components
violet_rails
an app engine for your business. Seamlessly implement business logic with a powerful API. Out of the box CMS, blog, forum and email functionality. Developer friendly & easily extendable for your next...
ethereum_analytical_db
Ethereum Analytical Database - Ethereum data access solution that can be used for analytics and application development. The solution works on a fast DB - Clickhouse.
pipebird
Pipebird is open source infrastructure for securely sharing data with customers.
data_engineering_with_python-track-datacamp
Data Engineer with Python lecture notes from #datacamp.
disaster-response-pipeline
ETL pipeline combined with supervised learning and grid search to classify text messages sent during a disaster event