Mehrzad
Mehrzad
Hi, I'm trying to train base VITS case on low resource language, we have prepared 27K data close to LJ settings. But during training, the KL loss converges to infinity...
**🚀 Feature Description** VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design + Transformer Blocks in flow mechanism + Text Encoder conditioned on speaker embeddings...
I faced an issue that may be solved in the future or have any solution available that I don't know. I have a scenario in which I have different categories...