NLP-progress icon indicating copy to clipboard operation
NLP-progress copied to clipboard

Voice Activity Detection

Open unhammer opened this issue 5 years ago • 1 comments

I didn't see anything on VAD, so maybe that should be a new category? I don't know enough about it to say if it could be considered a language independent task, nor what the current state of the art is (which is why I'm opening this issue ;-))

It does seem like webrtc-vad is used a lot, so that might be the de-facto baseline, while https://ieeexplore.ieee.org/document/8309294 / https://github.com/jtkim-kaist/VAD seems like a contender for state-of-the-art (has a freely available dataset).

unhammer avatar Apr 26 '19 07:04 unhammer

Thanks for the mention. We could potentially add this to a speech-related section if there's interest.

sebastianruder avatar Apr 29 '19 21:04 sebastianruder