data-curation topic
fiftyone
The open-source tool for building high-quality datasets and computer vision models
cleanlab-studio
Client interface for all things Cleanlab Studio
awesome-chemical-data
Curated list of known efforts in collecting and/or curating of chemical/materials data
Dataset-Curation-Tool
A tool for downloading from public image boards (which allow scraping) / preview your images & tags / edit your images & tags. Additional tabs for downloading other desired code repositories as well a...
CuBIDS
Curation of BIDS (CuBIDS): A sanity-preserving software package for processing BIDS datasets.
Learn2Clean
Learn2Clean: Optimizing the Sequence of Tasks for Data Preparation and Cleaning
ezbids
A web service for semi-automated conversion of raw imaging data to BIDS
TopDial
Code and data for "Target-oriented Proactive Dialogue Systems with Personalization: Problem Formulation and Dataset Curation" (EMNLP 2023)
SynRBL
Rebalancing chemical reaction
NeMo-Curator
Scalable data pre processing and curation toolkit for LLMs