iSTFTNet-pytorch icon indicating copy to clipboard operation
iSTFTNet-pytorch copied to clipboard

how about the quality of this net

Open hdmjdp opened this issue 3 years ago • 4 comments

Have you got good audio?

hdmjdp avatar Apr 26 '22 02:04 hdmjdp

yes better than hifi-gan with less training

rishikksh20 avatar Apr 26 '22 05:04 rishikksh20

yes better than hifi-gan with less training

but in my experiments, the stftnet has larger machine noise in the audio, did you have it?

also in the training, there is "max value is tensor(1.1451, device='cuda:0', grad_fn=<MaxBackward1>) " note

hdmjdp avatar Apr 26 '22 06:04 hdmjdp

also in the training, there is "max value is tensor(1.1451, device='cuda:0', grad_fn=) " note

Since no activation is applied to spec, only an exp to make it positive, its upper bound is unlimited, which causes an unlimited bound for the generated audio. Leaving the spec unbounded is not a good idea.

ease-zh avatar Dec 25 '23 03:12 ease-zh