kaldi-gstreamer-server icon indicating copy to clipboard operation
kaldi-gstreamer-server copied to clipboard

poor VAD performance

Open Umar17 opened this issue 6 years ago • 2 comments

Dear Alumae,

I have changed VAD threshold to 0.04 in decoder.py for gst cutter. But some garbage is still being decoded for signal below this threshold (RMS amplitude of about 0.002) in start of utterance. Have you checked whether this VAD works properly or not? Or Am I making some mistake?

Or can I get response with some chunk identification? (what was result of which packet?)

Best Regards

Umar17 avatar Aug 16 '18 12:08 Umar17

Have you solved this problem? I also got this when VAD set to relatively low.

yangxueruivs avatar Dec 20 '18 03:12 yangxueruivs

I have shifted HMM based acoustic model to Neural Net based which has back tracing feature as well and improves garbage decoding to a reasonable utterance. So I didn't need to focus on VAD afterwards.

Umar17 avatar Dec 20 '18 10:12 Umar17