data-curation topic
cleanlab
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
fastdup
fastdup is a powerful free tool designed to rapidly extract valuable insights from your image & video datasets. Assisting you to increase your dataset images & labels quality and reduce your data oper...
metamapper
Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.
data-as-a-science
Lesson guide and textbook for "Data as a Science" course.
xtreme1
Xtreme1 is an all-in-one data labeling and annotation platform for multimodal data training and supports 3D LiDAR point cloud, image, and LLM.
spotlight
Interactively explore unstructured datasets from your dataframe.
awesome-open-data-centric-ai
Curated list of open source tooling for data-centric AI on unstructured data.
data-centric-AI
A curated, but incomplete, list of data-centric AI resources.
sliceguard
A library for detecting problematic data segments in structured and unstructured data with few lines of code.
docta
A Doctor for your data