Nickolay V. Shmyrev
Nickolay V. Shmyrev
Fixes issue #40, issue #41, issue #36
It is not documented or enforced in the API but actually cuda online pipeline requires audio chunk to have exactly GetNSampsPerChunk of audio data. There is no buffering in cuda...
While running Voxceleb with different architectures I noticed that xvector extraction is very slow: ``` nnet3-xvector-compute --verbose=0 --use-gpu=no --min-chunk-size=25 --chunk-size=10000 \ --cache-capacity=64 "nnet3-copy \ --nnet-config=exp/xvector_nnet_1a/extract.config \ exp/xvector_nnet_1a/final.raw - |" "ark:apply-cmvn-sliding...
For more details on Vosk see https://github.com/alphacep/vosk-server https://github.com/alphacep/vosk-api
Hi Thank you for the nice software. Could you please share the information: 1) How long the training on librispeech takes on how many GPUs 2) How fast is the...
Should be lowercase actually
0.3.42 works, 0.3.43 doesn't 
There are projects like https://github.com/Babilinski/vosk-stt-unity and https://github.com/Yeti47/Vosk4Unity Eventually we need to have our own project for that.
https://github.com/wzhd/vosk-rs and https://github.com/shakacode/vosk-rs
As in https://github.com/daanzu/kaldi-active-grammar/issues/33 https://github.com/gooofy/zamia-speech/issues/106