Rishikesh (ऋषिकेश)
Rishikesh (ऋषिकेश)
Ok thanks, just a thing, the hyperparameter of VAE remain same for Tacotron 2 as of this repo or some thing needed to tweak for better performance? And roughly how...
`util` and `datasets` folders are not uploaded, I have copied from keithito's implementation. But still get : ``` from util.util import shape_list, vae_weight ImportError: No module named 'util.util' ``` may...
Same here I have trained model till 200K and when I passed referenced audio , it distort everything and generate some different voice. Check here is the pretrained models and...
@AndroXD @yanggeng1995 One observation, as I go through the samples uploaded by author of this paper here : http://home.ustc.edu.cn/~zyj008/ICASSP2019/ , I noticed that VAE tacotron distort and changes the original...
I tried it but still not able to get good speed on `baseline` model.
your audio should be single channel, check weather you pass single or dual channel.
don't use that branch, that branch no more maintained.
@QuantScientist this https://github.com/rishikksh20/FastSpeech2
some files are smaller than segment size of 8192 data points, remove the files which are smaller than 1 sec.
@Gorodecki I don't provide any pretain model, you can train this model from scratch