deltalake topic

List deltalake repositories

kafka-delta-ingest

330
Stars
68
Forks
Watchers

A highly efficient daemon for streaming data from Kafka into Delta Lake

delta-lake-internals

177
Stars
36
Forks
Watchers

The Internals of Delta Lake

Real-time-Data-Warehouse

100
Stars
40
Forks
Watchers

Real-time Data Warehouse with Apache Flink & Apache Kafka & Apache Hudi

ApacheSpark

82
Stars
59
Forks
Watchers

This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which we need in our real life experience as a data engineer. We wil...

smart-data-lake

100
Stars
20
Forks
Watchers

Smart Automation Tool for building modern Data Lakes and Data Pipelines

Streamis

97
Stars
42
Forks
Watchers

Streaming application development and management system, based on Linkis and DSS, planning to provide the workflow-like graphical drag-and-drop development capability.

dbldatagen

272
Stars
53
Forks
Watchers

Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POC...

databricks

46
Stars
32
Forks
Watchers

Databricks Platform - Architecture, Security, Automation and much more!!

olh

25
Stars
2
Forks
Watchers

Open source stack lakehouse

101_upsert-delta

40
Stars
5
Forks
Watchers

This repository exemplifies a simple ELT process using delta to perform upsert and remove data files that aren't in the latest state of the transaction log for the table.