pheme
pheme copied to clipboard
hallucination in the T2S stage.
Does the autoregressive decoding of the T2S stage induce random hallucinated results such as repeated words/phrases or long silences? how does this relate to the reported WER results?