StyleSpeech icon indicating copy to clipboard operation
StyleSpeech copied to clipboard

the audio quality is not good by using HiFi-GAN

Open 443127316 opened this issue 2 years ago • 1 comments

Same as the title, I use the HiFi-GAN vocoder to generate the audio. But there is full of noice in the audio. How could you make the qualified audio as the demo page. Could you pls share some experinece.

Thanks a lot.

443127316 avatar Dec 14 '21 13:12 443127316

Following the official code of HiFi-GAN, they inference audio by audio = audio * MAX_WAV_VALUE audio = audio.cpu().numpy().astype('int16') write(output_file, h.sampling_rate, audio)

In my case, it works when I changed the code a little as follow: audio = audio.cpu().float().numpy() write(output_file, h.sampling_rate, audio)

hcy71o avatar May 10 '22 11:05 hcy71o