lakehouse topic
doris
Apache Doris is an easy-to-use, high performance and unified analytics database.
presto
The official home of the Presto distributed SQL query engine for big data
starrocks
StarRocks, a Linux Foundation project, is a next-generation sub-second MPP OLAP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics, and ad-hoc queries.
lakehouse-engine
The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for several lakehouse algorithms, data flows and utilities for Data Prod...
lhbench
Lakehouse storage system benchmark
ytsaurus
YTsaurus is a scalable and fault-tolerant open-source big data platform.
gravitino
World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.
terraform-databricks-examples
Examples of using Terraform to deploy Databricks resources
Local-Data-LakeHouse
Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testing.
ByConity
ByConity is an open source cloud data warehouse