kaldi-gstreamer-server poor VAD performance

poor VAD performance

Open Umar17 opened this issue 6 years ago • 2 comments

Dear Alumae,

I have changed VAD threshold to 0.04 in decoder.py for gst cutter. But some garbage is still being decoded for signal below this threshold (RMS amplitude of about 0.002) in start of utterance. Have you checked whether this VAD works properly or not? Or Am I making some mistake?

Or can I get response with some chunk identification? (what was result of which packet?)

Best Regards

Aug 16 '18 12:08 Umar17

Have you solved this problem? I also got this when VAD set to relatively low.

Dec 20 '18 03:12 yangxueruivs

I have shifted HMM based acoustic model to Neural Net based which has back tracing feature as well and improves garbage decoding to a reasonable utterance. So I didn't need to focus on VAD afterwards.

Dec 20 '18 10:12 Umar17

kaldi-gstreamer-server kaldi-gstreamer-server copied to clipboard

poor VAD performance

kaldi-gstreamer-server
kaldi-gstreamer-server copied to clipboard