apachespark topic
hudi
Upserts, Deletes And Incremental Processing on Big Data.
ApacheSpark
This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which we need in our real life experience as a data engineer. We wil...
cleanframes
type-class based data cleansing library for Apache Spark SQL
SparkSQL.jl
SparkSQL.jl enables Julia programs to work with Apache Spark data using just SQL.
data-engineer-handbook
This is a repo with links to everything you'd ever want to learn about data engineering
docker_for_data_engineers
Code for blog at: https://www.startdataengineering.com/post/docker-for-de/
FLiPStackWeekly
FLaNK AI Weekly covering Apache NiFi, Apache Flink, Apache Kafka, Apache Spark, Apache Iceberg, Apache Ozone, Apache Pulsar, and more...