이준혁(Junhyeok Lee) comments

Results 37 comments of


                                            이준혁(Junhyeok Lee)

How do I upsample a wav once the model is trained?

Hi! Thank you for your interest! We already wrote code about wav file upsampling (inference) with trained model, but we did not write it on README. https://github.com/mindslab-ai/nuwave/blob/f941928c12db6d0e325e8c1997079d274ea4cede/sampling.py Maybe we could...

Different sampling rates

Hi! Thank you for the great question! We only tested for x2 and x3, but I think this work also available to apply on non-integer ratio upsampling. You only need...

Different sampling rates

Thank you for trying the additional experiment! Plz let me know if you need some help!

Different sampling rates

Interesting works! I expected that you need to train more. Since denoising score matching process is trained by random Gaussian noise and complicated, we also got similar problems. We trained...

Different sampling rates

In addition, if you have an STT model you could apply conditional score generation suggested by Yang Song (https://arxiv.org/pdf/2011.13456.pdf Section 5 and Appendix I).

Different sampling rates

Hmm, I think you need to modify the inference schedule instead of the training schedule. Since 8 iteration's value is fit to our setup, maybe it could not be optimal...

Different sampling rates

(Now I can see it) In my opinion, neural upsampling and accuracy of AST/STT could be irrelevant OR your dataset is small to train general model. If problem is a...

Different sampling rates

In addition, I am curious about your hyperparameters. Please let me know your batch or audio length or any difference between our [hparameter.yaml](https://github.com/mindslab-ai/nuwave/blob/master/hparameter.yaml) file.

Different sampling rates

Hello Viet Anh! As already mentioned, since the diffusion model is complicated, it needs a lot of time for training. Our results for targeting 48k was trained over 240k epochs...

Different sampling rates

Oh sorry for the misinformation. Yeah I mean 240k iterations instead of epoch.