DiffSinger icon indicating copy to clipboard operation
DiffSinger copied to clipboard

How to train another language?

Open Cardroid opened this issue 3 years ago • 4 comments

I think this "DiffSinger" model is based on Chinese. Please give me advice on how to train them in another language. Thank you for share!

Cardroid avatar Feb 14 '22 06:02 Cardroid

"DiffSinger" takes in phoneme, pitch, and duration. You need a Grapheme-to-Phoneme tool, like g2p_en for English, pypinyin for Chinese, or something for your language.

MoonInTheRiver avatar Feb 15 '22 10:02 MoonInTheRiver

"DiffSinger" takes in phoneme, pitch, and duration. You need a Grapheme-to-Phoneme tool, like g2p_en for English, pypinyin for Chinese, or something for your language.

Thank you for your answer. I have another question. How do you fine-tune the vocoder for the new dataset?

Cardroid avatar Feb 16 '22 04:02 Cardroid

Can you please explain how to train this Diffsinger in a different language? Does the vocoder need to be trained from scratch then, how?

morganne00 avatar Feb 19 '22 07:02 morganne00

Hi, can you please describe what needs to be done to train Diffsinger in a different language?

ktroktorin avatar Mar 12 '22 15:03 ktroktorin