DeepLearningExamples icon indicating copy to clipboard operation
DeepLearningExamples copied to clipboard

[FastPitch] Other language

Open Pawel-VRtechnology opened this issue 3 years ago • 3 comments

Hi everyone, I have a question about another language. Can I modify FastPitch to train other languages ​​like Polish ??

Pawel-VRtechnology avatar Oct 28 '22 07:10 Pawel-VRtechnology

Hi @Pawel-VRtechnology ,

That should be fairly straightforward. You're gonna need to adjust --text-cleaners, --symbol-set and either get ahold of a Polish pronunciation dictionary and supply it with --cmudict-path, or train on grapheme inputs with --p-arpabet=0.0.

Also, remember to set --pitch-mean and --pitch-std to your Polish speaker.

Our pre-trained WaveGlow checkpoint should work well for synthesis in Polish. For better results consider training HiFi-GAN.

alancucki avatar Oct 28 '22 14:10 alancucki

@alancucki thanks for the answer, i will try your advice :)

Pawel-VRtechnology avatar Oct 31 '22 07:10 Pawel-VRtechnology

Hi @alancucki I also train other languages like korean. But I'm going to train a multi-speaker. My question is : How to set --pitch-mean and --pitch-std? Do I have to get a mean for all the speakers' pitches?

Moon-sung-woo avatar Nov 02 '22 06:11 Moon-sung-woo

@Moon-sung-woo I was facing this problem too. The current implementation doesn't support multiple mean/std values for each speakers, but clearly that is needed in the multispeaker setting. Calculate these values for each speaker, store them in a file and load them in the DataLoader. This worked well for me.

martinvk1 avatar Feb 22 '23 09:02 martinvk1