Comprehensive-Transformer-TTS
Comprehensive-Transformer-TTS copied to clipboard
bug in calculate the energy in FastSpeechSTFT
I think here is a bug in audio/stft.py: 252
energy = np.sqrt(np.exp(mel) ** 2).sum(-1)
This code did nothing but just sum the abs of the np.exp(mel), while we expect it to calculate the sum before the sqrt.
The correct code should be
energy = np.sqrt((np.exp(mel) ** 2).sum(-1))