datalake topic

List datalake repositories
trafficstars

dataligo

47
Stars
3
Forks
Watchers

A library to accelerate ML and ETL pipeline by connecting all data sources

Uncoder_IO

126
Stars
22
Forks
Watchers

An IDE and translation engine for detection engineers and threat hunters. Be faster, write smarter, keep 100% privacy.

aws-auto-terminate-idle-emr

26
Stars
16
Forks
Watchers

AWS Auto Terminate Idle AWS EMR Clusters Framework is an AWS based solution using AWS CloudWatch and AWS Lambda using a Python script that is using Boto3 to terminate AWS EMR clusters that have been i...

AWS-Data-Lake

16
Stars
3
Forks
Watchers

AWS Lake Formation makes it easy for you to set up, secure, and manage your data lakes also data discovery using the metadata search capabilities of Lake Formation in the console, and metadata search...

DE-Zoomcamp

25
Stars
5
Forks
Watchers

Code/Notes for the Data Engineering Zoomcamp by DataTalksClub

Roota

114
Stars
8
Forks
Watchers

Roota is a public-domain language of threat detection and response that combines native queries from a SIEM, EDR, XDR, or Data Lake with standardized metadata and threat intelligence to enable automat...

openhouse

305
Stars
51
Forks
Watchers

Open Control Plane for Tables in Data Lakehouse

awesome-open-source-data-engineering

66
Stars
9
Forks
Watchers

A curated list of open source tools used in analytical stacks and data engineering ecosystem

jzfs

109
Stars
11
Forks
Watchers

Git based Version Control File System for joint management of code, data, model and their relationship.

awesome-olap

23
Stars
2
Forks
Watchers

A curated list of awesome Online Analytical Processing databases, frameworks, ressources and other awesomeness.