data-cleansing topic
optimus
:truck: Agile Data Preparation Workflows madeย easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
data-forge-ts
The JavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.
PClean
A domain-specific probabilistic programming language for scalable Bayesian data cleaning
data-analysis-using-python
Exploratory data analysis ๐using python ๐of used car ๐ database taken from โ๐๐๐๐๐
wrangler
Wrangler Transform: A DMD system for transforming Big Data
dedupe
Java DSL for (online) deduplication
Cousera_Google-Data-Analytics-Professional-Certificate
Quizzes & Assignment Solutions for Google Data Analytics Professional Certificate on Coursera. Also included a few resources on side that I found helpful.
desbordante-core
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algor...
Autism-Detection-in-Adults
This is a binary classification problem related with Autistic Spectrum Disorder (ASD) screening in Adult individual. Given some attributes of a person, my model can predict whether the person would ha...
Zillow-Home-Value-Prediction
XGBoost, LightGBM, LSTM, Linear Regression, Exploratory Data Analysis