self-supervised-speech-recognition
self-supervised-speech-recognition copied to clipboard
stt
I want to know what is the tgt_dict in the stt file, in the process_predictions method to use tgt_words and hypo_words to calculate the edit distance, there is no label input when decoding ah, what is this tgt_words