datalake topic
trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
lakeFS
lakeFS - Data version control for your data lake | Git for data
zingg
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
automate-dv
Hyperion pre installed on Raspberry Pi OS Lite
automate-dv
A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineering tool, registered trademark of dbt Labs)
aws-orbit-workbench
A Data Platform built for AWS, powered by Kubernetes.
doris
Apache Doris is an easy-to-use, high performance and unified analytics database.
hudi
Upserts, Deletes And Incremental Processing on Big Data.
delta-lake-internals
The Internals of Delta Lake
cuelake
Use SQL to build ELT pipelines on a data lakehouse.
hudi-resources
汇总Apache Hudi相关资料