data-cleaning topic
data-analytics-portfolio
Portfolio of data science and data analyst projects completed by me for academic, self learning, and hobby purposes.
covid-19-data-cleanup
Scripts to cleanup data from https://github.com/CSSEGISandData/COVID-19
DAT8
General Assembly's 2015 Data Science course in Washington, DC
pandas-videos
Jupyter notebook and datasets from the pandas video series
cleanlab
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
nonechucks
Deal with bad samples in your dataset dynamically, use Transforms as Filters, and more!
miller
Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
pandera
A light-weight, flexible, and expressive statistical data testing library
objectiv-analytics
Powerful product analytics for data teams, with full control over data & models.
Skytrax-Data-Warehouse
A full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for cloud data warehouse and Metabase to serve the needs of data vi...