speech-processing topic
hifigan-denoiser
HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
ttslearn
ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)
tfg-voice-conversion
Deep Learning-based Voice Conversion system
MultiBench
[NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning
Speech_Signal_Processing_and_Classification
Front-end speech processing aims at extracting proper features from short- term segments of a speech utterance, known as frames. It is a pre-requisite step toward any pattern recognition problem emplo...
SPTK
A suite of speech signal processing tools
VQ-VAE-Speech
PyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]
react-native-dialogflow
A React-Native Bridge for the Google Dialogflow (API.AI) SDK
ZZZ-RETIRED__openstt
RETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:
pytorch-kaldi-neural-speaker-embeddings
A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.