cleaning-data topic
pyjanitor
Clean APIs for data cleaning. Python implementation of R package Janitor
clean-dialog
A framework for cleaning Chinese dialog data
Time-series-Preprocessing-Studio-in-Jupyter
Time-series Data Preprocessing Studio in Jupyter notebook.
AutoDataCleaner
Simple and automatic data cleaning in one line of code! It performs one-hot encoding, date & time casting to datetime dtype, detects binary columns, safely convert non-numeric columns to numeric dtype...
Area-Under-the-Margin-Ranking
Implementation of the paper Identifying Mislabeled Data using the Area Under the Margin Ranking: https://arxiv.org/pdf/2001.10528v2.pdf
corpusexplorer2.0
Korpuslinguistik war noch nie so einfach...
meteor-simple-schema
Meteor integration package for simpl-schema
cleantext
An open-source package for python to clean raw text data
Cleaner-Royall
🚀 𝗔 𝗠𝗼𝘀𝘁 𝗔𝗱𝘃𝗮𝗻𝗰𝗲 𝗖𝗹𝗲𝗮𝗻𝗲𝗿 𝗙𝗼𝗿 𝗔𝗻𝗱𝗿𝗼𝗶𝗱 [Root]
GPSClean
An application to correct a GPS trace using machine learning techniques. To preview it, a small web interface, named GPSClean Web, is available