FG-transformer-TTS
FG-transformer-TTS copied to clipboard
About the frame rate of preprocessing
I found that TransformerTTS is triained and verified on fr=22050 audios, while as wav2vec2.0 needed, the GST and LST are with input audio of fr=16000. why? Is it will be better when with the same frame rate? I see no explaination in the paper.