TTS
TTS copied to clipboard
[Feature request] pronounciation, cadence and nuances in XTTS v2...
Hello! I have used xTTS v2 for a while and made great voices. I sih to know one thing: every voice made, when it "speaks" has the same cadence and pronounciation (clearly from a trained model). How could I get from the speaker also that? I mean, to really clone a voice, you don't need only the frequencies but also their nuances. Can you please post an example or even better, add the feture directly in xTTSv2? So that one can decide if getting a standard voice, a "speaker" voice, or a speaker voice and "nuance". That would be great! Thanks.