inaSpeechSegmenter VAD to detect simultaneous music and voice

VAD to detect simultaneous music and voice

Open realies opened this issue 5 years ago • 3 comments

trafficstars

This is more of a feature request - is it possible to detect simultaneous music and voice?

Sep 27 '20 19:09 realies

Right now, this would require to design new voice activity detection systems within inaspeechsegmenter. Are you aware of corpora allowing to design and evaluate such systems ?

Feb 16 '21 16:02 DavidDoukhan

Not really. I presumed the preexisting functionality and datasets can be changed to distinguish between music and music with narration over it, based on some confidence ratio. Your comment makes it sound like to achieve this, the project needs a completely different VAD system?

Apr 27 '21 19:04 realies

@DavidDoukhan, could existing corpora be used to mix music and voice with various ratios and extend the training dataset in a new VAD mode?

Nov 22 '21 09:11 realies

@DavidDoukhan, is this really completed?

Sep 05 '23 13:09 realies

This won't be done.

Sep 05 '23 13:09 DavidDoukhan

inaSpeechSegmenter inaSpeechSegmenter copied to clipboard

VAD to detect simultaneous music and voice

inaSpeechSegmenter
inaSpeechSegmenter copied to clipboard