Rishikesh (ऋषिकेश) comments

Results 162 comments of


                                            Rishikesh (ऋषिकेश)

trafficstars

Does VAE model supports Tacotron 2 ?

Ok thanks, just a thing, the hyperparameter of VAE remain same for Tacotron 2 as of this repo or some thing needed to tweak for better performance? And roughly how...

Does VAE model supports Tacotron 2 ?

`util` and `datasets` folders are not uploaded, I have copied from keithito's implementation. But still get : ``` from util.util import shape_list, vae_weight ImportError: No module named 'util.util' ``` may...

My results are not as good as the examples

Same here I have trained model till 200K and when I passed referenced audio , it distort everything and generate some different voice. Check here is the pretrained models and...

My results are not as good as the examples

@AndroXD @yanggeng1995 One observation, as I go through the samples uploaded by author of this paper here : http://home.ustc.edu.cn/~zyj008/ICASSP2019/ , I noticed that VAE tacotron distort and changes the original...

[Proposal] Reduce training time by resampling beforehand

I tried it but still not able to get good speed on `baseline` model.

Assertion error: torchaaudio resample_waveform related

your audio should be single channel, check weather you pass single or dual channel.

Assertion error: torchaaudio resample_waveform related

don't use that branch, that branch no more maintained.

What did you use to generate the mel files from text?

@QuantScientist this https://github.com/rishikksh20/FastSpeech2

Error with fine_tuning as True with mel generated from Glow-TTS

some files are smaller than segment size of 8192 data points, remove the files which are smaller than 1 sec.

Pretrained checkpoint

@Gorodecki I don't provide any pretain model, you can train this model from scratch