deltalake topic
kafka-delta-ingest
A highly efficient daemon for streaming data from Kafka into Delta Lake
delta-lake-internals
The Internals of Delta Lake
Real-time-Data-Warehouse
Real-time Data Warehouse with Apache Flink & Apache Kafka & Apache Hudi
ApacheSpark
This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which we need in our real life experience as a data engineer. We wil...
smart-data-lake
Smart Automation Tool for building modern Data Lakes and Data Pipelines
Streamis
Streaming application development and management system, based on Linkis and DSS, planning to provide the workflow-like graphical drag-and-drop development capability.
dbldatagen
Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POC...
databricks
Databricks Platform - Architecture, Security, Automation and much more!!
101_upsert-delta
This repository exemplifies a simple ELT process using delta to perform upsert and remove data files that aren't in the latest state of the transaction log for the table.