TTS [Feature request] VITS 2 implementation

🚀 Feature Description

VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design

Transformer Blocks in flow mechanism
Text Encoder conditioned on speaker embeddings

I even love you more with these modifications ! Soon ?

Aug 01 '23 10:08 mehrzadai

@mehrzadai can you post a paper link ?

Aug 01 '23 10:08 manmay-nakhashi

@manmay-nakhashi https://arxiv.org/abs/2307.16430 I guess the changes will solve many things like high length dependency and some latency issues.

Aug 01 '23 13:08 mehrzadai

I think @p0p4k is working on it.

Aug 04 '23 08:08 erogol

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. You might also look our discussion channels.

Sep 07 '23 02:09 stale[bot]

Hi! any updates on implementing VITS2?

Nov 11 '23 08:11 S-Ali-Zaidi

I think @p0p4k gave up on that :(

Nov 13 '23 13:11 erogol

@S-Ali-Zaidi Ill do it in this month, sorry for delay!

Nov 15 '23 15:11 p0p4k

ok then reopened 👍

Nov 16 '23 10:11 erogol

Yay, glad to hear it! Thank you both!

Nov 25 '23 23:11 S-Ali-Zaidi

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. You might also look our discussion channels.

Dec 26 '23 18:12 stale[bot]

Someone is supposedly still working on it

Dec 26 '23 18:12 andersmmg

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. You might also look our discussion channels.

Jan 28 '24 22:01 stale[bot]

TTS TTS copied to clipboard

[Feature request] VITS 2 implementation

TTS
TTS copied to clipboard