vits2 icon indicating copy to clipboard operation
vits2 copied to clipboard

VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design

Results 15 vits2 issues
Sort by recently updated
recently updated
newest added

Hi, I found the implementation of ResidualCouplingLayer.forward(normalize_flow.py) is different from official VITS code, and this section is not described in the VITS2 paper. what principle your implementation is based on?...

What version of the following module is? Please give us the detailed version. torch torchvision torchaudio torchtext I have the following problem: /python3.11/site-packages/torchtext/lib/libtorchtext.so: undefined symbol: _ZN5torch3jit21setUTF8DecodingIgnoreEb And what's your cuda...

I've gotten to step 4 of making a custom dataset (skipping the LJ Speech and VCTK steps) and I've stumbled across a `ModuleNotFoundError`. I'm not too sure how this is...

Hey there I trained a model to 42,000 steps on master. And, it sounds like the voice that I trained it on but.. the phonemes sound like eSpeak-EN-US. Just wondering...

I have a concern about whether the random variable z_q_dur should follow a normal distribution or not. When assuming z_q_dur follows a normal distribution, of course z_audio will follow a...

can here: [Bert-VITS2-Faste](https://github.com/foocker/Bert-VITS2-Faster)

Hi, I was wondering if there is anyone who is currently working on a german version of VITS2 :)

question

n_speakers > 0 has problem

help wanted