fairseq
fairseq copied to clipboard
Evaluating TTS using STT
I tried to get a text back from the speech I generated using this model, but the text was a bit of. So maybe having some type of test could help while improving the model.
Basically, "some text" -> speech output -> speech input -> text output -> check if text output is "some text"