datalake topic
dataligo
A library to accelerate ML and ETL pipeline by connecting all data sources
Uncoder_IO
An IDE and translation engine for detection engineers and threat hunters. Be faster, write smarter, keep 100% privacy.
aws-auto-terminate-idle-emr
AWS Auto Terminate Idle AWS EMR Clusters Framework is an AWS based solution using AWS CloudWatch and AWS Lambda using a Python script that is using Boto3 to terminate AWS EMR clusters that have been i...
AWS-Data-Lake
AWS Lake Formation makes it easy for you to set up, secure, and manage your data lakes also data discovery using the metadata search capabilities of Lake Formation in the console, and metadata search...
DE-Zoomcamp
Code/Notes for the Data Engineering Zoomcamp by DataTalksClub
Roota
Roota is a public-domain language of threat detection and response that combines native queries from a SIEM, EDR, XDR, or Data Lake with standardized metadata and threat intelligence to enable automat...
openhouse
Open Control Plane for Tables in Data Lakehouse
awesome-open-source-data-engineering
A curated list of open source tools used in analytical stacks and data engineering ecosystem
jzfs
Git based Version Control File System for joint management of code, data, model and their relationship.
awesome-olap
A curated list of awesome Online Analytical Processing databases, frameworks, ressources and other awesomeness.