WaveGrad icon indicating copy to clipboard operation
WaveGrad copied to clipboard

Poor Synthesis Quality on 44k Sample Rate

Open Malik7115 opened this issue 2 years ago • 1 comments

Hello, I have two somewhat identical datasets with similar samples. I have trained the WaveGrad with 22k sample rate audios and it is quite good. However, the synthesis quality for 44k sample rate data is not as good. Would really appreciate any suggestions, especially in terms of changing model parameters. The only changes in parameters are as follows:

sample rate: 44k n_fft: 2048 window_size: 2048 ho_length: 512

Malik7115 avatar Apr 11 '22 08:04 Malik7115

@Malik7115 Hey, I can suggest you to take a look on the Large WaveGrad model in the orig WaveGrad paper (not implemented here, but I think it is easy to adjust the current code). It definitely would be slower a bit, but at the same time should obtain better quality.

ivanvovk avatar Apr 12 '22 10:04 ivanvovk