data-ingestion topic
airbyte
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
cuelake
Use SQL to build ELT pipelines on a data lakehouse.
broadway
Concurrent and multi-stage data ingestion and data processing with Elixir
pravega
Pravega - Streaming as a new software defined storage primitive
squirrel-core
A Python library that enables ML teams to share, load, and transform data in a collaborative, flexible, and efficient way :chestnut:
net.jgp.labs.spark
Apache Spark examples exclusively in Java
thedataengineeringbook
The Data Engineering Book - หนังสือวิศวกรรมข้อมูล ของคนไทย เพื่อคนไทย
amazon-kinesis-data-processor-aws-fargate
Sample code for the AWS Big Data Blog Post Building a scalable streaming data processor with Amazon Kinesis Data Streams on AWS Fargate
data-integration-library
The Data Integration Library project provides a library of generic components based on a multi-stage architecture for data ingress and egress.
history
Download and warehouse historical trading data