FastSpeech2 Does it able to learn certain voice style?

Does it able to learn certain voice style?

Open lucasjinreal opened this issue 1 year ago • 3 comments

Does it able to learn certain voice style?

Sep 28 '22 07:09 lucasjinreal

Hi, thanks for your question. This repo doesn't support learning voice style for now. We might need a style encoder if we want to learn the voice style. Recently, instead, we have been focusing on multilingual TTS. such as supporting Chinese, Taiwanese, and so on.

Oct 04 '22 11:10 ga642381

@ga642381 hi, does multilane tts performant can compatible with single lan? isn't the phone space would be very large?

Oct 04 '22 12:10 lucasjinreal

I agree with you. So the collaborator of this repo, Wei-Ping Huang, does have some research on how to use self-supervised features to learn shared phonetic information across different languages. (ref: Few-Shot Cross-Lingual TTS Using Transferable Phoneme Embedding https://arxiv.org/abs/2206.15427)

As for this repo, I think at least we can support different datasets for various languages to make it more friendly for the community to do multispeaker, multilingual TTS research.

Oct 04 '22 13:10 ga642381

FastSpeech2 FastSpeech2 copied to clipboard

Does it able to learn certain voice style?

FastSpeech2
FastSpeech2 copied to clipboard