speech-dataset topic
Speech_Feature_Extraction
Feature extraction of speech signal is the initial stage of any speech recognition system.
RNN-SM
[T-IFS] RNN-SM: Fast Steganalysis of VoIP Streams Using Recurrent Neural Network
youtube_tts_data_generator
A python library to generate speech dataset from Youtube videos
ruslan-corpus.github.io
Trigger-Word-Detection
Construct a speech dataset and implement an algorithm for trigger word detection (sometimes also called keyword detection, or wakeword detection).
EmoTa
EmoTa is an open-access Tamil Speech Emotion Recognition dataset with 936 utterances from 22 native speakers, covering five emotions (anger, happiness, sadness, fear, and neutrality). It supports emot...