vits2 issues

2

Hi, I found the implementation of ResidualCouplingLayer.forward(normalize_flow.py) is different from official VITS code, and this section is not described in the VITS2 paper. what principle your implementation is based on?...

nzpeng

What version of torch？

2

What version of the following module is? Please give us the detailed version. torch torchvision torchaudio torchtext I have the following problem： /python3.11/site-packages/torchtext/lib/libtorchtext.so: undefined symbol: _ZN5torch3jit21setUTF8DecodingIgnoreEb And what's your cuda...

aijianiula0601

ModuleNotFoundError | Step 4 of Custom Dataset

5

I've gotten to step 4 of making a custom dataset (skipping the LJ Speech and VCTK steps) and I've stumbled across a `ModuleNotFoundError`. I'm not too sure how this is...

641i130

paperr0se

question

n_speakers > 0 有问题啊兄弟

3

n_speakers > 0 has problem

wlz987

help wanted

vits2
vits2 copied to clipboard

Metadata

Can I copy vocab.txt from ljs_base to vctk? How to preprocess text in vctk?

How much computing resources are needed for training?

normalize_flow 和官方VITS代码不一样的实现方式

What version of torch？

ModuleNotFoundError | Step 4 of Custom Dataset

espeak phoneme tokenization - failed experiment?

Should z_q_dur drawn from Gaussian distribution?

faster?

Anyone working on a German vits2?

n_speakers > 0 有问题啊兄弟

← Metadata

Owner

Metadata

vits2 vits2 copied to clipboard

Metadata

← Metadata

Owner

Metadata

vits2
vits2 copied to clipboard