voicefixer icon indicating copy to clipboard operation
voicefixer copied to clipboard

some clip in tail

Open deyituo opened this issue 2 years ago • 5 comments

As it needs to pad some zeros to fit the downsample ratio, I find that it may cause the mel to have large values in the last frames. But I cannot find the reason. And if I pad more self.downsample_ratio frames, it will be ok. emm...

code: https://github.com/haoheliu/voicefixer/blob/main/voicefixer/restorer/model_kqq_bn.py#L70

pad_len += self.downsample_ratio

deyituo avatar Dec 24 '21 12:12 deyituo

I also have this issues with some audios, but cannot seem to fix it. I added more padding but it did not help.

johannahom avatar Jun 06 '22 07:06 johannahom

any update on this?

sciai-ai avatar Aug 03 '22 09:08 sciai-ai

Sorry for the late update. I'm trying to look into it recently. Will inform this thread once it's fixed.

haoheliu avatar Aug 03 '22 10:08 haoheliu

Hey @haoheliu! How is it going? Do u need help with fix?

Afiyetolsun avatar Aug 30 '23 20:08 Afiyetolsun

Error still exist, i think length can be cutted to by %SampRate of restore_inmem fft

Afiyetolsun avatar Aug 30 '23 20:08 Afiyetolsun