audio-datasets icon indicating copy to clipboard operation
audio-datasets copied to clipboard

open-source audio datasets

Results 18 audio-datasets issues
Sort by recently updated
recently updated
newest added

The Emotional Voices Database: Towards Controlling the Emotional Expressiveness in Voice Generation Systems This dataset is built for the purpose of emotional speech synthesis. The transcript were based on the...

CREMA-D CREMA-D is a data set of 7,442 original clips from 91 actors. These clips were from 48 male and 43 female actors between the ages of 20 and 74...

ANAD - Arabic Natural Audio Dataset --- - [X] Added DataSet info - [X] Added DVC - [X] Added tags on DAGsHub - [X] Added License information - [X] Dataset...

About: Extract Runescape classic sounds from cache to wav (and vice versa). Jagex used Sun's original .au sound format, which is headerless, 8-bit, u-law encoded, 8000 Hz pcm samples. this...

# Microsoft Scalable Noisy Speech Dataset [MS-SNSD](https://github.com/microsoft/MS-SNSD) About: * This dataset contains a large collection of clean speech files and variety of environmental noise files in .wav format sampled at...

**Claim Dataset:** [Flickr Audio Caption](https://groups.csail.mit.edu/sls/downloads/flickraudio/) **About Dataset:** The Flickr 8k Audio Caption Corpus contains 40,000 spoken captions of 8,000 natural images. It was collected in 2015 to investigate multimodal learning...

**Claim Dataset:** [CMU-MOSI](https://www.amir-zadeh.com/datasets) **About Dataset:** CMU Multimodal Opinion Sentiment Intensity (CMU-MOSI) is a dataset of opinion level sentiment intensity in online videos. The CMU-MOSI dataset opened the door to utterance...

Hi there, After reading through some documentation, I noticed a few spelling and punctuation errors. Glad I could contribute!