TTS
TTS copied to clipboard
[Feature request] VITS 2 implementation
π Feature Description
VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design
- Transformer Blocks in flow mechanism
- Text Encoder conditioned on speaker embeddings
I even love you more with these modifications ! Soon ?
@mehrzadai can you post a paper link ?
@manmay-nakhashi https://arxiv.org/abs/2307.16430 I guess the changes will solve many things like high length dependency and some latency issues.
I think @p0p4k is working on it.
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. You might also look our discussion channels.
Hi! any updates on implementing VITS2?
I think @p0p4k gave up on that :(
@S-Ali-Zaidi Ill do it in this month, sorry for delay!
ok then reopened π
Yay, glad to hear it! Thank you both!
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. You might also look our discussion channels.
Someone is supposedly still working on it
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. You might also look our discussion channels.