TTS [Feature request] pronounciation, cadence and nuances in XTTS v2...

[Feature request] pronounciation, cadence and nuances in XTTS v2...

Open 0wwafa opened this issue 8 months ago • 6 comments

Hello! I have used xTTS v2 for a while and made great voices. I sih to know one thing: every voice made, when it "speaks" has the same cadence and pronounciation (clearly from a trained model). How could I get from the speaker also that? I mean, to really clone a voice, you don't need only the frequencies but also their nuances. Can you please post an example or even better, add the feture directly in xTTSv2? So that one can decide if getting a standard voice, a "speaker" voice, or a speaker voice and "nuance". That would be great! Thanks.

May 29 '24 18:05 0wwafa

TTS TTS copied to clipboard

[Feature request] pronounciation, cadence and nuances in XTTS v2...

TTS
TTS copied to clipboard