shravank
shravank
udacity-data-eng-proj-1
Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation, validation and loading of data from S3 -> Redshift -> S3
udacity-data-eng-proj2
A production-grade data pipeline has been designed to automate the parsing of user search patterns to analyze user engagement. Extract data from S3, apply a series of transformations and load into S3...
udacity-data-eng-proj3
Built a stream processing data pipeline to get data from disparate systems into a dashboard using Kafka as an intermediary.
udacity-data-eng-proj4
Developed an ETL pipeline for a Data Lake that extracts data from S3, processes the data using Spark, and loads the data back into S3 as a set of dimensional tables. Lake Processing: Spark, Lake Stor...