training-data topic
instapy-gender-classification
🔎 Classification helper for sex classification feature of InstaPy
biomedical_corpora
Table compiling the list of biomedically-related corpora available for named entity recognition (and some also suitable for association detection). First version has was published as part of the paper...
nsfw-image-urls
A repository of NSFW images to be used for machine learning/image classification purposes
hairnet-ai
Machine Learning project aimed at converting images into .obj 3D models by representing them as Blender hair-type particle systems.
COVID-19-train-audio
COVID-19 Coughs files for training AI models
ruler
Data Programming by Demonstration (DPBD) for Document Classification
AIAssistedImageVideoLabelling
AI Assisted Image and Video Training Data Labeling @ Scale
swim-ir
SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 languages, generated using PaLM 2 and summarize-then-ask promptin...
Shapley_Valuation
PyTorch reimplementation of computing Shapley values via Truncated Monte Carlo sampling from "What is your data worth? Equitable Valuation of Data" by Amirata Ghorbani and James Zou [ICML 2019]
planesnet
Labeled training data for detection of aircraft in Planet satellite imagery