FastSpeech Have anyone tried using LSTM to replace FFT block?

Have anyone tried using LSTM to replace FFT block?

Open BuaaAlban opened this issue 4 years ago • 0 comments

I have trained [37800/192000] steps, and it seems won't converge to a good value, especially the duration loss, it doesn't change much.

Mel Loss: 2.8434, Mel PostNet Loss: 2.5580, Duration Loss: 2.3693;

Aug 24 '20 02:08 BuaaAlban