vits2_pytorch
vits2_pytorch copied to clipboard
unofficial vits2-TTS implementation in pytorch
Hi I'm sungwoo Moon. First of all, thank you for your sharing your code. I'm looking at your code and I'm wondering why you use 'use_mel_posterior_encoder'. In the paper vits1,...
  This is my log and tensorboard log.
Hello , Thanks for all the effort to create this repo. When I launch training it runs for a few steps and then I see no progress at all. Its...
refer to https://github.com/p0p4k/vits2_pytorch/issues/84#issue-2212649665
https://github.com/p0p4k/vits2_pytorch/blob/1f4f3790568180f8dec4419d5cad5d0877b034bb/train_ms.py#L172-L188 Due to this line, the `net_dur_disc` maintain None
Is it possible to use AlignerNet (aligner.py in pflow-tts repo) instead of MAS in VITS2? What should be changed in the code? I am a bit confused on what the...
This is a follow up to the previous discussion threads regarding stochastic duration predictor in https://github.com/p0p4k/vits2_pytorch/issues/11 and https://github.com/p0p4k/vits2_pytorch/issues/68#issuecomment-1839917607, as well as with the reference of [Bert-VITS2](https://github.com/fishaudio/Bert-VITS2): Regarding training using SDP,...
Creating this PR to debug adding AlignerNet - based on this discussion: https://github.com/p0p4k/vits2_pytorch/issues/81
Hi, all We are trying to decrease the model size. as this project set, the feature channels are 192 (inter_channels=192, hidden_channels=192), has anyone tried to lower the channels, eg.160 ?...
Hello, When I train using a custom dataset, I encounter the following error with the following parameters: "filter_length": 2048, "hop_length": 512, "win_length": 2048, File "/mnt/Linux_DATA/synthesis/model/vits2_pytorch/train_ms.py", line 441, in train_and_evaluate loss_mel...