dataloaders
dataloaders copied to clipboard
Pytorch and TensorFlow data loaders for several audio datasets
dataloaders
Pytorch and TFRecords data loaders for several audio datasets
Datasets
- ESC - dataset of environmental sounds
- [x] ESC Downloader
- [x] Pytorch DataSet
- [x] TFRecords Loader
- LibriSpeech - corpus of read English speech
- [x] LibriSpeech downloader for PyTorch
- [x] PyTorch DataSet
- [x] PyTorch DataSet for TFRecord
- [x] PyTorch DataLoaders for TFRecord
- [x] TFRecords Loader
- [x] TFRecords Generator
- NSynth - dataset of annotated musical notes
- [x] NSynth downloader and generator of *.h5py and *.tfrecord formats
- [x] TFRecord reader
- [x] PyTorch Dataset
- [x] PyTorch Dataset for TFrecord
- [x] PyTorch DataLoaders for TFRecord
- VoxCeleb2 - human speech, extracted from YouTube interview videos
- [ ] Pytorch loader
- [ ] TFRecords loader
- GTZAN - audio tracks from a variety of sources annotated with genre class
- [x] GTZAN Downloader
- [x] PyTorch DataSet
- CallCenter - audio tracks with human and non-human speech
- [x] PyTorch DataSet
For validation we frequently use the following scheme:
- Read 10 random crops from a file;
- Predict a class for each crop;
- Averaging results.
For this scheme we've done additional DataLoaders for PyTorch: