End-to-end-ASR-Pytorch icon indicating copy to clipboard operation
End-to-end-ASR-Pytorch copied to clipboard

Encoding Target

Open MishaimMalik opened this issue 5 years ago • 1 comments

In Step 0, you mentioned we can use one of the following options phoneme/char/subword/word. But when I choose "word" instead of "subword". The encoding doesn't recognize it. The error is in line 134 of preprocess_librispeech.py. It occurs in the function read_text().

Can we apply the same encoding as subword on wor"?

Also for the subword option the bpe.vocab file is missing (in case of LibriSpeech). Do we have to generate it ourselves? If, yes then how?

MishaimMalik avatar Mar 07 '19 05:03 MishaimMalik

I think you need to modification read_text function. I modified some of the code to fit the format in which the data exists, so it worked. For example, I don't have .trans.txt. Every trans exists individually.

jybaek avatar May 09 '19 09:05 jybaek