training-data topic

List training-data repositories

diffgram

1.8k
Stars
118
Forks
Watchers

The AI Datastore for Schemas, BLOBs, and Predictions. Use with your apps or integrate built-in Human Supervision, Data Workflow, and UI Catalog to get the most value out of your AI Data.

skweak

913
Stars
73
Forks
Watchers

skweak: A software toolkit for weak supervision applied to NLP tasks

myvision

568
Stars
65
Forks
Watchers

Computer vision based ML training data generation tool :rocket:

TagEditor

178
Stars
13
Forks
Watchers

🏖TagEditor - Annotation tool for spaCy

amazoncaptcha

419
Stars
75
Forks
Watchers

Pure Python, lightweight, Pillow-based solver for Amazon's text captcha.

ydata-synthetic

1.3k
Stars
227
Forks
Watchers

Synthetic data generators for tabular and time-series data

fountain

117
Stars
9
Forks
Watchers

Natural Language Data Augmentation Tool for Conversational Systems

label-tool

341
Stars
73
Forks
Watchers

Web application for image labeling and segmentation

snorkel

5.7k
Stars
860
Forks
Watchers

A system for quickly generating training data with weak supervision

augmenty

148
Stars
11
Forks
Watchers

Augmenty is an augmentation library based on spaCy for augmenting texts.