vits2_pytorch icon indicating copy to clipboard operation
vits2_pytorch copied to clipboard

Why are you using "use_mel_posterior_encoder"?

Open Moon-sung-woo opened this issue 9 months ago • 2 comments

Hi I'm sungwoo Moon. First of all, thank you for your sharing your code.

I'm looking at your code and I'm wondering why you use 'use_mel_posterior_encoder'. In the paper vits1, it says that we use spectrogram like vits1, but I wonder if there is a difference in TTS performance.

Thank you.

Moon-sung-woo avatar May 17 '24 02:05 Moon-sung-woo