apachespark topic
List
apachespark repositories
hudi
5.1k
Stars
2.4k
Forks
1.2k
Watchers
Upserts, Deletes And Incremental Processing on Big Data.
ApacheSpark
82
Stars
59
Forks
Watchers
This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which we need in our real life experience as a data engineer. We wil...
cleanframes
81
Stars
8
Forks
Watchers
type-class based data cleansing library for Apache Spark SQL
SparkSQL.jl
25
Stars
0
Forks
Watchers
SparkSQL.jl enables Julia programs to work with Apache Spark data using just SQL.
data-engineer-handbook
9.4k
Stars
1.3k
Forks
224
Watchers
This is a repo with links to everything you'd ever want to learn about data engineering
docker_for_data_engineers
23
Stars
9
Forks
Watchers
Code for blog at: https://www.startdataengineering.com/post/docker-for-de/