Matcha-TTS Matcha compared to Vits

Matcha compared to Vits

Open yygg678 opened this issue 5 months ago • 2 comments

I replicated the results of VITS and Matcha-TTS on a single speaker Chinese dataset and found that the timbre similarity of Matcha-TTS is lower than that of VITS, especially in the high-frequency details of the spectrum. Below are the spectrograms of VITS and Matcha-TTS. Is there any way to improve the timbre similarity of Matcha-TTS?

Sep 22 '24 05:09 yygg678

Matcha-TTS Matcha-TTS copied to clipboard

Matcha compared to Vits

Matcha-TTS
Matcha-TTS copied to clipboard