nsynth_wavenet The waveform generated by parallel wavenet looks like the result of the teacher, but quiet noisy.

The waveform generated by parallel wavenet looks like the result of the teacher, but quiet noisy.

Open bfs18 opened this issue 6 years ago • 9 comments

teacher waveform te_waveform student waveform st_waveform teacher spectrogram te_spec student spectrogram st_spec

The student is run with weight normalization and the default configurations in the parallel_wavenet.json. The wave generated by the student is still quiet noisy. I am testing data dependent initialization fro weight normalization.

Jun 18 '18 10:06 bfs18

Nice. How is the f0 looks like?

Jun 18 '18 10:06 zhang-jian

hi @bfs18 ! Nice. How many steps have been trained to get that results?

Jun 25 '18 09:06 maozhiqiang

@maozhiqiang The above result is evaluated at 50k steps. I also generate waves at 150k steps. It is a bit clearer, but still noisy.

Jun 25 '18 12:06 bfs18

@bfs18 how about the teacher network's performance? the good teacher network is very very important!

Jun 26 '18 00:06 maozhiqiang

https://github.com/bfs18/nsynth_wavenet/blob/data_dep_init/tests/pred_data-pwn-failed_cases/gen_LJ001-0001-stft_abs.wav I got a bit clearer waveform. @zhang-jian @maozhiqiang

Jun 30 '18 17:06 bfs18

Just wondering if you have tried to train the student model on KL loss only?

Jul 03 '18 10:07 zhang-jian

@zhang-jian I tried that, but didn't get meaningful result. Experimenting on KL + power loss is more promising. Besides I have limited computing resource. I didn't spent much time on it.

Jul 03 '18 12:07 bfs18

Hi @bfs18 , How many iterations does the teacher network take to get a result like tests/pred_data-use_mu_law+mol/gen_LJ001-0001.wav ?

Aug 02 '18 10:08 HallidayReadyOne

HI @HallidayReadyOne 200K steps.

Aug 02 '18 21:08 bfs18

nsynth_wavenet nsynth_wavenet copied to clipboard

The waveform generated by parallel wavenet looks like the result of the teacher, but quiet noisy.

nsynth_wavenet
nsynth_wavenet copied to clipboard