data-lakehouse topic

List data-lakehouse repositories

qbeast-spark

199
Stars
17
Forks
Watchers

Qbeast-spark: DataSource enabling multi-dimensional indexing and efficient data sampling. Big Data, free from the unnecessary!

Local-Data-LakeHouse

45
Stars
8
Forks
Watchers

Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testing.

awesome-open-source-data-engineering

66
Stars
9
Forks
Watchers

A curated list of open source tools used in analytical stacks and data engineering ecosystem