kaldi-gstreamer-server
kaldi-gstreamer-server copied to clipboard
poor VAD performance
Dear Alumae,
I have changed VAD threshold to 0.04 in decoder.py for gst cutter. But some garbage is still being decoded for signal below this threshold (RMS amplitude of about 0.002) in start of utterance. Have you checked whether this VAD works properly or not? Or Am I making some mistake?
Or can I get response with some chunk identification? (what was result of which packet?)
Best Regards
Have you solved this problem? I also got this when VAD set to relatively low.
I have shifted HMM based acoustic model to Neural Net based which has back tracing feature as well and improves garbage decoding to a reasonable utterance. So I didn't need to focus on VAD afterwards.