icefall How to test audio files with model trained on Gigaspeech

How to test audio files with model trained on Gigaspeech

Open WangHelin1997 opened this issue 3 years ago • 1 comments

trafficstars

Hi authors,

Do you have any suggestions about how to test audio files with model trained on Gigaspeech (you provided)? Thanks so much.

Nov 02 '22 20:11 WangHelin1997

There are multiple ways. You can choose either of them.

Go to https://huggingface.co/spaces/k2-fsa/automatic-speech-recognition this approach is the simplest one. You only need a browser.
Use pretrained.py Unfortunately, there is no pretrained.py in gigapseech folder. Normally, there will be a pretraniend.py in each folder, e.g., librispeech, that you can use to decode waves with a pretrained model.
Use https://github.com/k2-fsa/sherpa/blob/master/sherpa/bin/pruned_transducer_statelessX/offline_asr.py it also takes a pretrained model as input to decode wave files. You can find detailed usages inside the file.
There are also c++ binaries from https://github.com/k2-fsa/sherpa that you can use. Please read the doc of sherpa for more details.

Nov 03 '22 00:11 csukuangfj