apachespark topic

List apachespark repositories

hudi

4.9k
Stars
2.5k
Forks
1.2k
Watchers

Upserts, Deletes And Incremental Processing on Big Data.

ApacheSpark

79
Stars
55
Forks
Watchers

This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which we need in our real life experience as a data engineer. We wil...

sparkProjectTemplate.g8

92
Stars
37
Forks
Watchers

Template for Spark Projects

cleanframes

81
Stars
8
Forks
Watchers

type-class based data cleansing library for Apache Spark SQL

SparkSQL.jl

24
Stars
0
Forks
Watchers

SparkSQL.jl enables Julia programs to work with Apache Spark data using just SQL.

data-engineer-handbook

6.6k
Stars
945
Forks
177
Watchers

This is a repo with links to everything you'd ever want to learn about data engineering