apachespark topic

List apachespark repositories

hudi

5.1k
Stars
2.4k
Forks
1.2k
Watchers

Upserts, Deletes And Incremental Processing on Big Data.

ApacheSpark

82
Stars
59
Forks
Watchers

This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which we need in our real life experience as a data engineer. We wil...

sparkProjectTemplate.g8

98
Stars
42
Forks
Watchers

Template for Spark Projects

cleanframes

81
Stars
8
Forks
Watchers

type-class based data cleansing library for Apache Spark SQL

SparkSQL.jl

25
Stars
0
Forks
Watchers

SparkSQL.jl enables Julia programs to work with Apache Spark data using just SQL.

data-engineer-handbook

9.4k
Stars
1.3k
Forks
224
Watchers

This is a repo with links to everything you'd ever want to learn about data engineering

docker_for_data_engineers

23
Stars
9
Forks
Watchers

Code for blog at: https://www.startdataengineering.com/post/docker-for-de/