cleaning-data topic

List cleaning-data repositories

pyjanitor

1.3k
Stars
167
Forks
Watchers

Clean APIs for data cleaning. Python implementation of R package Janitor

clean-dialog

251
Stars
25
Forks
Watchers

A framework for cleaning Chinese dialog data

Time-series Data Preprocessing Studio in Jupyter notebook.

AutoDataCleaner

17
Stars
4
Forks
Watchers

Simple and automatic data cleaning in one line of code! It performs one-hot encoding, date & time casting to datetime dtype, detects binary columns, safely convert non-numeric columns to numeric dtype...

Area-Under-the-Margin-Ranking

21
Stars
6
Forks
Watchers

Implementation of the paper Identifying Mislabeled Data using the Area Under the Margin Ranking: https://arxiv.org/pdf/2001.10528v2.pdf

corpusexplorer2.0

20
Stars
3
Forks
Watchers

Korpuslinguistik war noch nie so einfach...

cleantext

67
Stars
11
Forks
Watchers

An open-source package for python to clean raw text data

Cleaner-Royall

60
Stars
5
Forks
Watchers

🚀 𝗔 𝗠𝗼𝘀𝘁 𝗔𝗱𝘃𝗮𝗻𝗰𝗲 𝗖𝗹𝗲𝗮𝗻𝗲𝗿 𝗙𝗼𝗿 𝗔𝗻𝗱𝗿𝗼𝗶𝗱 [Root]

GPSClean

17
Stars
0
Forks
Watchers

An application to correct a GPS trace using machine learning techniques. To preview it, a small web interface, named GPSClean Web, is available