speech-processing topic
TDNN
Time delay neural network (TDNN) implementation in Pytorch using unfold method
awesome-keyword-spotting
This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).
Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
MevonAI-Speech-Emotion-Recognition
Identify the emotion of multiple speakers in an Audio Segment
CleanUNet
Official PyTorch Implementation of CleanUNet (ICASSP 2022)
IMS-Toucan
Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.
formant-analyzer
iOS application for finding formants in spoken sounds
GCommandsPytorch
ConvNets for Audio Recognition using Google Commands Dataset
CNN-VAD
A Convolutional Neural Network based Voice Activity Detector for Smartphones
DNC
Discriminative Neural Clustering for Speaker Diarisation