kaldi-offline-transcriber icon indicating copy to clipboard operation
kaldi-offline-transcriber copied to clipboard

Offline transcription system for Estonian using Kaldi

Results 14 kaldi-offline-transcriber issues
Sort by recently updated
recently updated
newest added

Hey, Not sure if `dockerfile` is up to date, as I am getting the error below when trying to build it on local ``` docker build . -t kaldi-et ```...

https://github.com/alumae/kaldi-offline-transcriber/blob/caadc224137c0c86e71e6bb8f65ab2b8d718f09b/local/json2trs.py#L40 ``` Traceback (most recent call last): File "./local/json2trs.py", line 64, in print_sections(trans["sections"], trans["speakers"]) File "./local/json2trs.py", line 40, in print_sections print('' % (speakers[turn["speaker"]]["transcriber_id"], turn["start"], turn["end"])) KeyError: 'S0' ``` tdnn_7d_online_pruned_rescored_main_rnnlm_unk.punctuated.normalized.json ```...

Fresher distributions with sox 14.4.2 and above fails with "-2" option. This replaces it with "-b 16", which works. See here: https://sourceforge.net/p/sox/code/ci/45b161d73ec087a8e003747b1aed07cd33589bca/tree/ChangeLog Did not test on distributions older than ubuntu...

I'm a Kaldi noob but interested in using your set up for English. I looked at your other project and the Kaldi discussion boards, and this model seems like a...

I cannot find _parse_options.sh_ script used by main script _speech2text.sh_

Perhaps the README could clarify what the expected process output is when speaker ID feature is enabled? What is supposed to look different in the text output compared to disabling...

bug

I updated my kaldi copy to current latest 5.1.114 on that branch and everything seems to work well. @alumae have you tested on 5.2 already?

8-core machine could plow through diarization faster if parallelized - what's the biggest complexity stopping us from having it?

Created a Makefile.options file with the following: `KALDI_ROOT=/home/$USER/tools/kaldi` But when running the makefile, somehow the symlinks created, namely sid, steps and utils, are broken. When I examined the symlinks, instead...

Hi, I think you have twice the same section in your makefile build/audio/segmented/%: build/diarization/%/show.seg Also I do not fully understand this line build/trans/%/nnet2_online_ivector_pruned/decode/log: build/fst/nnet2_online_ivector/final.mdl build/fst/nnet2_online_ivector/graph_prunedlm build/trans/%/spk2utt build/trans/%/mfcc I think build/trans/%/spk2utt...