Rongjiehuang
Rongjiehuang
FastDiff
PyTorch Implementation of FastDiff (IJCAI'22)
Multi-Singer
PyTorch Implementation of Multi-Singer (ACM-MM'21)
Multiband-WaveRNN
An unofficial implement of autoregressive vocoder Multiband-WaveRNN. Audio samples in https://rongjiehuang.github.io/Multiband-WaveRNN/
ProDiff
PyTorch Implementation of ProDiff (ACM-MM'22) with a Extremely-Fast diffusion speech synthesis pipeline
GenerSpeech
PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.
awesome-speech-to-speech-translation
List of direct speech-to-speech translation papers.
TranSpeech
PyTorch Implementation of TranSpeech (ICLR'23): Textless NAR Speech-to-Speech Translation with Bilateral Perturbation