StyleSpeech
StyleSpeech copied to clipboard
the audio quality is not good by using HiFi-GAN
Same as the title, I use the HiFi-GAN vocoder to generate the audio. But there is full of noice in the audio. How could you make the qualified audio as the demo page. Could you pls share some experinece.
Thanks a lot.
Following the official code of HiFi-GAN, they inference audio by audio = audio * MAX_WAV_VALUE audio = audio.cpu().numpy().astype('int16') write(output_file, h.sampling_rate, audio)
In my case, it works when I changed the code a little as follow: audio = audio.cpu().float().numpy() write(output_file, h.sampling_rate, audio)