xVA-Synth icon indicating copy to clipboard operation
xVA-Synth copied to clipboard

How about translation other languages?

Open psycalc opened this issue 3 years ago • 3 comments

Guide plі in which (direction) neuron network (neural framework) you use and why? in wich direction should I look, in order too make voices more realistic, and sound in other languages? Is it possible at all or it is very complex and hard to train network?

psycalc avatar May 22 '21 18:05 psycalc

https://becominghuman.ai/generating-neural-speech-synthesis-voice-acting-using-xvasynth-fc978fdf24c1 sorry find myself

psycalc avatar May 22 '21 18:05 psycalc

v3 now supports multiple languages. A voice trained in English can somewhat also speak another language. Though more monotonally.

Pendrokar avatar Jul 14 '23 12:07 Pendrokar

That article is super old. The v3 model now uses a slightly custom tweaked VITS/YourTTS model. Tweaks including larger capacity, bigger lang embedding, custom symbol set (a custom spec of ARPAbet with some more phonemes to cover other languages), and I guess a different training script.

DanRuta avatar Jul 15 '23 08:07 DanRuta