data-cleaning topic

List data-cleaning repositories

data-analytics-portfolio

84
Stars
22
Forks
Watchers

Portfolio of data science and data analyst projects completed by me for academic, self learning, and hobby purposes.

covid-19-data-cleanup

25
Stars
13
Forks
Watchers

Scripts to cleanup data from https://github.com/CSSEGISandData/COVID-19

DAT8

1.6k
Stars
1.1k
Forks
Watchers

General Assembly's 2015 Data Science course in Washington, DC

pandas-videos

2.1k
Stars
1.9k
Forks
Watchers

Jupyter notebook and datasets from the pandas video series

cleanlab

9.3k
Stars
722
Forks
Watchers

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

nonechucks

374
Stars
27
Forks
Watchers

Deal with bad samples in your dataset dynamically, use Transforms as Filters, and more!

miller

8.6k
Stars
202
Forks
Watchers

Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON

pandera

3.1k
Stars
282
Forks
Watchers

A light-weight, flexible, and expressive statistical data testing library

objectiv-analytics

402
Stars
24
Forks
Watchers

Powerful product analytics for data teams, with full control over data & models.

Skytrax-Data-Warehouse

132
Stars
26
Forks
Watchers

A full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for cloud data warehouse and Metabase to serve the needs of data vi...