supervoice-vall-e-2 icon indicating copy to clipboard operation
supervoice-vall-e-2 copied to clipboard

Misspelling issues.

Open patriotyk opened this issue 5 months ago • 0 comments

I have tried your models(voicebox and this one) and vall-e-2 sounds more natural, but there is lot of misspellings in the generated speech. Is it because of dataset? Have you tried to train voicebox on the libriheavy?

patriotyk avatar Sep 03 '24 12:09 patriotyk