FastSpeech2 icon indicating copy to clipboard operation
FastSpeech2 copied to clipboard

Does it able to learn certain voice style?

Open lucasjinreal opened this issue 1 year ago • 3 comments

Does it able to learn certain voice style?

lucasjinreal avatar Sep 28 '22 07:09 lucasjinreal

Hi, thanks for your question. This repo doesn't support learning voice style for now. We might need a style encoder if we want to learn the voice style. Recently, instead, we have been focusing on multilingual TTS. such as supporting Chinese, Taiwanese, and so on.

ga642381 avatar Oct 04 '22 11:10 ga642381

@ga642381 hi, does multilane tts performant can compatible with single lan? isn't the phone space would be very large?

lucasjinreal avatar Oct 04 '22 12:10 lucasjinreal

I agree with you. So the collaborator of this repo, Wei-Ping Huang, does have some research on how to use self-supervised features to learn shared phonetic information across different languages. (ref: Few-Shot Cross-Lingual TTS Using Transferable Phoneme Embedding https://arxiv.org/abs/2206.15427)

As for this repo, I think at least we can support different datasets for various languages to make it more friendly for the community to do multispeaker, multilingual TTS research.

ga642381 avatar Oct 04 '22 13:10 ga642381