vosk-android-demo icon indicating copy to clipboard operation
vosk-android-demo copied to clipboard

Voice separation on multiple inputs via microphone

Open MuhammadRashid opened this issue 4 years ago • 3 comments

Hi Nickolay, Here is a scenario, suppose an audio is playing in a user's Andorid device and meanwhile an announcement occurs outside, now microphone is capturing multiple inputs. How can we recognise and separate these two sources (audio playing in a user's device and announcement outside)?

MuhammadRashid avatar Nov 13 '19 09:11 MuhammadRashid

I didn't investigate this deeply, it should be possible to record without capturing playback, but no idea how to do it properly.

nshmyrev avatar Nov 13 '19 23:11 nshmyrev

Hi Nickolay, The model directory contains HCLr.fst and Gr.fst, so can you please show me how to produce them? I found a script to produce HCL.fst and G.fst at https://github.com/opendcd/opendcd/blob/master/script/makeclevel.sh. Can i use it? Thanks!

huynguyen82 avatar Dec 05 '19 07:12 huynguyen82

@huynguyen82 please create a separate issue, do not post into other unrelated issue. Graph is created with mkgraph_lookahead_arpa.sh, see https://github.com/kaldi-asr/kaldi/pull/3616

nshmyrev avatar Dec 05 '19 08:12 nshmyrev