Aaron (Yinghao) Li

Results 9 repositories owned by Aaron (Yinghao) Li

StarGANv2-VC

464
Stars
110
Forks
Watchers

StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion

AuxiliaryASR

103
Stars
29
Forks
Watchers

Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)

PitchExtractor

106
Stars
25
Forks
Watchers

Deep Neural Pitch Extractor for Voice Conversion and TTS Training

StyleTTS

358
Stars
58
Forks
Watchers

Official Implementation of StyleTTS

StyleTTS2

4.2k
Stars
323
Forks
Watchers

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

PL-BERT

191
Stars
31
Forks
Watchers

Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions

HiFTNet

109
Stars
11
Forks
Watchers

HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform

SLMGAN

15
Stars
0
Forks
Watchers

SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs