zhufeijuanjuan
zhufeijuanjuan
I modified VITS to training multi-lingual voices (english and chinese) by concat a language-specified embedding tensor **emb_lang** to text embedding **emb_t**. Everything keeps the same except the hidden channel of...
segmentation fault appears after train a few steps when batch size >16, everything is ok when batch size
Great work. I test piper-tts on my terminal device, the voice sounds good. But I met a critical issue was that the cpu usage was nearly 100% when running piper-tts...