datalake topic

List datalake repositories
trafficstars

terraform-azure-data

37
Stars
30
Forks
Watchers

Terraform script to deploy almost all Azure Data Services

datapipelines-essentials-python

53
Stars
35
Forks
Watchers

Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformati...

iceberg-assembly

31
Stars
11
Forks
Watchers

汇总Apache Iceberg相关的最新文章、资料以及Demo等

apiary

35
Stars
8
Forks
Watchers

Apiary provides modules which can be combined to create a federated cloud data lake

apiary-data-lake

18
Stars
25
Forks
Watchers

Terraform scripts for deploying Apiary Data Lake

serverless-datalake-example

16
Stars
5
Forks
Watchers

A serverless datalake project and framework based on AWS S3,Glue,Athena,MWAA and QuickSight. With a series of best practices, it guides you how to build a serverless datalake.

enceladus

29
Stars
14
Forks
Watchers

Dynamic Conformance Engine

anyscale

49
Stars
10
Forks
Watchers

anyscale roadmap