datacleaning topic
kaggle-with-R
All kaggle datasets and the R codes
HyperGBM
A full pipeline AutoML tool for tabular data
Twitter-Sentiment-Analysis
It is a Natural Language Processing Problem where Sentiment Analysis is done by Classifying the Positive tweets from negative tweets by machine learning models for classification, text mining, text a...
great_expectations
Always know what to expect from your data.
OpenRefine
OpenRefine is a free, open source power tool for working with messy data and improving it
dataprep
Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
covid_19_jhu_data_web_scrap_and_cleaning
This repository contains data and code used to get and clean data from https://github.com/CSSEGISandData/COVID-19 and https://www.worldometers.info/coronavirus/
covid-19-india-data
data and code for scrapping and cleaning data on covid-19 in India from https://www.mohfw.gov.in/ and https://www.covid19india.org/
amora-data-build-tool
Amora Data Build Tool enables analysts and engineers to transform data on the data warehouse (BigQuery) by writing Amora Models that describe the data schema using Python's "PEP484 - Type Hints" and s...
validatedb
Validate on a table in a DB, using dbplyr