MeloTTS icon indicating copy to clipboard operation
MeloTTS copied to clipboard

More than 1 speaker per language?

Open jidkano opened this issue 11 months ago • 7 comments

Hello, would it be possible to add a default of more than just 1 speaker per language? Maybe 3-4 (2 males, 2 females) per language would be great.

jidkano avatar Jan 20 '25 01:01 jidkano

Yes. When training a dataset like this:

path|speaker1|lang|text

So, you can do:

path|speaker1|lang1|text path|speaker2|lang1|text path|speaker3|lang1|text

yukiarimo avatar Jan 21 '25 20:01 yukiarimo

Hey, just checked. Don’t do this! If you want same speak to speak different languages - create new speaker, but use same voice but with different language. It is the best approach!

yukiarimo avatar Feb 19 '25 18:02 yukiarimo

@yukiarimo I also wanna add more speakers. You mean it is the best to do like this?

path|speaker1|lang1|text path|speaker2|lang2|text path|speaker3|lang3|text

youngandbin avatar Apr 18 '25 02:04 youngandbin

Exactly!

yukiarimo avatar Apr 18 '25 04:04 yukiarimo

@yukiarimo Hello, I'm doing finetuning to add korean speaker. I have a question. How long should I train for?

luckycontrol avatar Sep 03 '25 08:09 luckycontrol

Minimum data -> 1 hour

Good data -> 10 hours

Enough data -> 24 hours

How long? -> Generate audio each 10k steps. Stop the training whenever you like the output

yukiarimo avatar Sep 03 '25 17:09 yukiarimo

@yukiarimo Thank you very much for your reply

luckycontrol avatar Sep 03 '25 23:09 luckycontrol