audiomate icon indicating copy to clipboard operation
audiomate copied to clipboard

Dataset Collection

Open ynop opened this issue 7 years ago • 2 comments

Just a list to collect available audio datasets.

Collections

  • http://www.audiocontentanalysis.org/data-sets/
  • https://github.com/JRMeyer/open-speech-corpora
  • http://www.openslr.org/resources.php
  • https://github.com/jim-schwoebel/voice_datasets

Varia:

  • [ ] AVA Speech (http://research.google.com/ava/download.html#ava_speech_download)

Environmental Sounds:

  • [x] https://github.com/karoldvl/ESC-50
  • [x] http://urbansounddataset.weebly.com/urbansound8k.html
  • [x] https://data.vision.ee.ethz.ch/cvl/ae_dataset/
  • [ ] Freesound Dataset (https://datasets.freesound.org)
  • [ ] MIVIA (http://mivia.unisa.it/datasets/audio-analysis/mivia-audio-events/)
  • [ ] DESED (https://github.com/turpaultn/DESED)

Music:

  • [ ] http://mirg.city.ac.uk/codeapps/the-magnatagatune-dataset
  • [ ] https://github.com/mdeff/fma
  • [ ] https://sigsep.github.io/musdb
  • [ ] http://medleydb.weebly.com/
  • [ ] http://mtg.upf.edu/ismir2004/contest/tempoContest/node5.html
  • [ ] DALI: https://github.com/gabolsgabs/DALI

Speech:

  • [x] TIMIT: https://github.com/philipperemy/timit
  • [ ] TED-lium: http://www-lium.univ-lemans.fr/en/content/ted-lium-corpus
  • [x] TUDA: https://www.inf.uni-hamburg.de/en/inst/ab/lt/resources/data/acoustic-models.html
  • [x] LibriSpeech: http://www.openslr.org/12/
  • [ ] AMI Corpus: http://groups.inf.ed.ac.uk/ami/corpus/
  • [ ] SIWIS: http://www.unige.ch/lettres/linguistique/research/latl/siwis/
  • [ ] Buckeye: https://buckeyecorpus.osu.edu/
  • [ ] CSTR VCTK: http://homepages.inf.ed.ac.uk/jyamagis/page3/page58/page58.html
  • [x] Mozilla Common Voice: https://voice.mozilla.org/en/data
  • [x] Free Spoken Digit Set: https://github.com/Jakobovski/free-spoken-digit-dataset
  • [ ] Vystadial 2013: https://lindat.mff.cuni.cz/repository/xmlui/handle/11858/00-097C-0000-0023-4671-4
  • [x] https://tatoeba.org/eng
  • [x] The Spoken Wikipedia Corpora (https://nats.gitlab.io/swc/)
  • [ ] QUESST-2014 (http://speech.fit.vutbr.cz/software/quesst-2014-multilingual-database-query-by-example-keyword-spotting)
  • [ ] https://github.com/nicolaspanel/TrainingSpeech
  • [ ] SWS2013 (http://speech.fit.vutbr.cz/software/sws-2013-multilingual-database-query-by-example-keyword-spotting)
  • [ ] CSS10 (https://github.com/Kyubyong/css10)
  • [ ] CMU Wilderness(https://github.com/festvox/datasets-CMU_Wilderness)
  • [x] M-AILABS (http://www.m-ailabs.bayern/en/the-mailabs-speech-dataset/)
  • [x] AudioMNIST (https://github.com/soerenab/AudioMNIST)
  • [ ] COSINE (http://melodi.ee.washington.edu/cosine/)
  • [x] Fluent Speech Commands (http://www.fluent.ai/research/fluent-speech-commands/)
  • [ ] Scotus Speech (https://github.com/noajshu/scotus-speech)
  • [ ] Living Audio Dataset (https://github.com/Idlak/Living-Audio-Dataset)
  • [ ] VOiCES (https://voices18.github.io/downloads/)
  • [ ] EUSpeech (https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/GKABNU)
  • [ ] LibriVoxDeEn (https://www.cl.uni-heidelberg.de/statnlpgroup/librivoxdeen/#beilharz19)

ynop avatar Jan 19 '18 16:01 ynop

Hello, I am new to the gitlab collaborating world and coding. Very eager to help out and gain some experience. How can I help out with this issue?

emzee831 avatar Oct 10 '19 22:10 emzee831

Hi, if you like you can implement a reader for another dataset/corpus from the list above or any other one you know.

You can find examples of existing implementations in https://github.com/ynop/audiomate/tree/master/audiomate/corpus/io.

There is some documentation on adding new datasets as well: https://audiomate.readthedocs.io/en/latest/documentation/new_dataset_format.html

This one (http://www.openslr.org/1/) for example looks like a simple one to start. Let me know if you need any help.

If you start, please open a new issue for further discussions.

ynop avatar Oct 10 '19 22:10 ynop