inaSpeechSegmenter icon indicating copy to clipboard operation
inaSpeechSegmenter copied to clipboard

VAD to detect simultaneous music and voice

Open realies opened this issue 5 years ago • 3 comments
trafficstars

This is more of a feature request - is it possible to detect simultaneous music and voice?

realies avatar Sep 27 '20 19:09 realies

Right now, this would require to design new voice activity detection systems within inaspeechsegmenter. Are you aware of corpora allowing to design and evaluate such systems ?

DavidDoukhan avatar Feb 16 '21 16:02 DavidDoukhan

Not really. I presumed the preexisting functionality and datasets can be changed to distinguish between music and music with narration over it, based on some confidence ratio. Your comment makes it sound like to achieve this, the project needs a completely different VAD system?

realies avatar Apr 27 '21 19:04 realies

@DavidDoukhan, could existing corpora be used to mix music and voice with various ratios and extend the training dataset in a new VAD mode?

realies avatar Nov 22 '21 09:11 realies

@DavidDoukhan, is this really completed?

realies avatar Sep 05 '23 13:09 realies

This won't be done.

DavidDoukhan avatar Sep 05 '23 13:09 DavidDoukhan