dataset-generation topic
voc2007_for_yolo_torch
:punch: Prepare VOC format datasets for ultralytics/yolov3 & yolov5
pansori
Tools for ASR Corpus Generation from Online Video
Bamboo
Bamboo: 4 times larger than ImageNet; 2 time larger than Object365; Built by active learning.
Ransomware-Json-Dataset
Compiles a json dataset using public sources that contains properties to aid in the detection and mitigation of over 1000 variants of ransomware.
celebA-HQ-dataset-download
Get started with CelebA-HQ dataset in under 5 mins !
smart_categorizer
Trainable categorization tool
gap
Gazebo plugins for applying domain randomization
download_audioset
📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).
the-youtube-scraper
Download YouTube video description and video comments without using the YouTube API.
TRScraper
TRScraper, doğal dil işleme uygulamalarında kullanılmak amacıyla geliştirilmiş, Türkçe içerik girilen büyük platformlarda metin madenciliği yapma imkanı sunan bir uygulamadır.