FG-transformer-TTS icon indicating copy to clipboard operation
FG-transformer-TTS copied to clipboard

About the frame rate of preprocessing

Open JohnHerry opened this issue 2 years ago • 0 comments

I found that TransformerTTS is triained and verified on fr=22050 audios, while as wav2vec2.0 needed, the GST and LST are with input audio of fr=16000. why? Is it will be better when with the same frame rate? I see no explaination in the paper.

JohnHerry avatar Mar 22 '22 03:03 JohnHerry