data-lakehouse topic
List
data-lakehouse repositories
qbeast-spark
199
Stars
17
Forks
Watchers
Qbeast-spark: DataSource enabling multi-dimensional indexing and efficient data sampling. Big Data, free from the unnecessary!
Local-Data-LakeHouse
45
Stars
8
Forks
Watchers
Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testing.
awesome-open-source-data-engineering
66
Stars
9
Forks
Watchers
A curated list of open source tools used in analytical stacks and data engineering ecosystem