Vitthal Mirji

Results 1 repositories owned by Vitthal Mirji

datapipelines-essentials-python

53
Stars
35
Forks
Watchers

Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformati...