apachespark topic

List apachespark repositories

hudi

5.1k
Stars
2.4k
Forks
Watchers

Upserts, Deletes And Incremental Processing on Big Data.

ApacheSpark

82
Stars
59
Forks
Watchers

This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which we need in our real life experience as a data engineer. We wil...

sparkProjectTemplate.g8

98
Stars
42
Forks
Watchers

Template for Spark Projects

cleanframes

81
Stars
8
Forks
Watchers

type-class based data cleansing library for Apache Spark SQL

SparkSQL.jl

25
Stars
0
Forks
Watchers

SparkSQL.jl enables Julia programs to work with Apache Spark data using just SQL.

data-engineer-handbook

37.1k
Stars
7.1k
Forks
Watchers

This is a repo with links to everything you'd ever want to learn about data engineering

docker_for_data_engineers

23
Stars
9
Forks
Watchers

Code for blog at: https://www.startdataengineering.com/post/docker-for-de/

FLiPStackWeekly

16
Stars
0
Forks
Watchers

FLaNK AI Weekly covering Apache NiFi, Apache Flink, Apache Kafka, Apache Spark, Apache Iceberg, Apache Ozone, Apache Pulsar, and more...