pytorch-kaldi icon indicating copy to clipboard operation
pytorch-kaldi copied to clipboard

Word transcription of TIMIT dataset

Open shessam opened this issue 4 years ago • 1 comments

How can word-level instead of phoneme-level speech recognition be done with the TIMIT dataset? I build and train models. On the other hand, I have only phoneme transcription. I want word transcription of audio files. Would you help me?

shessam avatar Nov 29 '20 20:11 shessam

Hi, this should certainly be managed at the Kaldi level as labels and features are generated with Kaldi !

TParcollet avatar Dec 31 '20 17:12 TParcollet