Nikita Grebenyuk comments

Results 47 comments of


                                            Nikita Grebenyuk

Can I use it without GPU?

Yes, you can use it on CPU (RAM), just add .cpu() command to some lines in script (which will give your error).

The config of hifigan used when generate samples

> Thanks, will upload the pre-trained hifi-gan model as well as the configuration file soon. Could you share link?

Is punctuation an essential part of input when training TTS model?

VITS authors don't answer here. Probably you should test it yourself, but I think if you use your own letter set and some letters can replace comma, dot etc, why...

Problems with the pronunciation of one word.

Try to add short phrases into your dataset. If it's trained to say some phonemes only in connection with other, it can't do single word well.

Problems with the pronunciation of one word.

> If it is caused by data-hunger, then how much data needed for each speaker if I make a multi-speaker instance? About 2 hours is minimum for good result.

How to finetune the given pre-trained model?

No discriminator model, so no way for fine-tuning.

[Question] How many iterations for the available pretrained model?

"All models in the ablation study were trained up to 300k steps" From paper.

[Question] How many iterations for the available pretrained model?

> So the available checkpoint in the GitHub repository is also trained up to 300k steps? Must be so. Train dataset is 12.5k, so it's ~1500 iterations over the whole...

Not all .wav files generate corresponding spec.pt

Too big wav files are omitted: https://github.com/jaywalnut310/vits/blob/main/data_utils.py#L302 You can try to change it from here (increase numbers in boundaries, for example `[32,300,450,600,750,1000,1200,1400,1600]`): https://github.com/jaywalnut310/vits/blob/main/train.py#L70 (but note that too big files may...

Not all .wav files generate corresponding spec.pt

Or nicer way - move it in config and setup as you want: https://github.com/jaywalnut310/vits/pull/119/commits/490b60abe3978650d4341c0c64dc49ab76287d58