datacleansing topic
List
datacleansing repositories
OpenRefine
10.6k
Stars
1.9k
Forks
Watchers
OpenRefine is a free, open source power tool for working with messy data and improving it
SparkClean
26
Stars
6
Forks
Watchers
A Scalable Data Cleaning Library for PySpark.
table_enforcer
17
Stars
1
Forks
Watchers
Table Enforcer is my attempt to apply a sort of "test driven development" workflow to data cleaning and validation. A python package to facilitate the iterative process of developing and using schema-...
Data-Visualisations-using-Power-BI
17
Stars
7
Forks
Watchers
Data visualisations in Power BI