training-data topic

List training-data repositories

instapy-gender-classification

34
Stars
7
Forks
Watchers

🔎 Classification helper for sex classification feature of InstaPy

biomedical_corpora

18
Stars
4
Forks
Watchers

Table compiling the list of biomedically-related corpora available for named entity recognition (and some also suitable for association detection). First version has was published as part of the paper...

nsfw-image-urls

110
Stars
13
Forks
Watchers

A repository of NSFW images to be used for machine learning/image classification purposes

hairnet-ai

25
Stars
0
Forks
Watchers

Machine Learning project aimed at converting images into .obj 3D models by representing them as Blender hair-type particle systems.

COVID-19-train-audio

41
Stars
16
Forks
Watchers

COVID-19 Coughs files for training AI models

ruler

35
Stars
6
Forks
Watchers

Data Programming by Demonstration (DPBD) for Document Classification

AIAssistedImageVideoLabelling

21
Stars
7
Forks
Watchers

AI Assisted Image and Video Training Data Labeling @ Scale

swim-ir

41
Stars
2
Forks
Watchers

SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 languages, generated using PaLM 2 and summarize-then-ask promptin...

Shapley_Valuation

23
Stars
5
Forks
Watchers

PyTorch reimplementation of computing Shapley values via Truncated Monte Carlo sampling from "What is your data worth? Equitable Valuation of Data" by Amirata Ghorbani and James Zou [ICML 2019]

planesnet

30
Stars
8
Forks
Watchers

Labeled training data for detection of aircraft in Planet satellite imagery