Ramses Alexander Coraspe Valdez
Ramses Alexander Coraspe Valdez
apache-spark-docker
Dockerizing an Apache Spark Standalone Cluster
data-engineer-challenge
Challenge Data Engineer
pyspark-on-aws-emr
The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on writing pyspark code.
uber-expenses-tracking
The goal of this project is to track the expenses of Uber Rides and Uber Eats through data Engineering processes using technologies such as Apache Airflow, AWS Redshift and Power BI.
csv-schema-inference
A tool to automatically infer columns data types in .csv files
pyDag
Scheduling Big Data Workloads and Data Pipelines in the Cloud with pyDag
Dropout-Students-Prediction
The goal of this project is to identify students at risk of dropping out the school