kaldi icon indicating copy to clipboard operation
kaldi copied to clipboard

[Fisher] Issue on semisup/run_100k.sh

Open JuanPZuluaga opened this issue 4 years ago • 3 comments

Hi,

I've come across two problems in the https://github.com/kaldi-asr/kaldi/blob/master/egs/fisher_english/s5/local/semisup/run_100k.sh recipe.

  1. On stage 7: When running local/fisher_train_lms_pocolm.sh I get an error becuase the number o n-grams of the dataset (100k) is smaller than the number of n-grams to prune:
the num-ngrams(1544907) of input LM is less than the target-num-ngrams(5000000), can not do any pruning.
  1. On stage 10: the param --sup-lat-dir $exp_root/chain/tri4a_train_sup_unk_lats should be changed to --sup-lat-dir $exp_root/chain/tri4a_train_sup_sp_unk_lats which uses the sp version instead.

R, Juan Pablo

JuanPZuluaga avatar Nov 03 '21 07:11 JuanPZuluaga

This issue has been automatically marked as stale by a bot solely because it has not had recent activity. Please add any comment (simply 'ping' is enough) to prevent the issue from being closed for 60 more days if you believe it should be kept open.

stale[bot] avatar Jan 02 '22 14:01 stale[bot]

  1. Does the first message cause any issues? Is not it just a warning?
  2. Do you want to send a PR for the second one?

Thanks!

kkm000 avatar Mar 31 '22 00:03 kkm000

This issue has been automatically marked as stale by a bot solely because it has not had recent activity. Please add any comment (simply 'ping' is enough) to prevent the issue from being closed for 60 more days if you believe it should be kept open.

stale[bot] avatar May 31 '22 05:05 stale[bot]