NLP-progress
NLP-progress copied to clipboard
Voice Activity Detection
I didn't see anything on VAD, so maybe that should be a new category? I don't know enough about it to say if it could be considered a language independent task, nor what the current state of the art is (which is why I'm opening this issue ;-))
It does seem like webrtc-vad is used a lot, so that might be the de-facto baseline, while https://ieeexplore.ieee.org/document/8309294 / https://github.com/jtkim-kaist/VAD seems like a contender for state-of-the-art (has a freely available dataset).
Thanks for the mention. We could potentially add this to a speech-related section if there's interest.