pliers icon indicating copy to clipboard operation
pliers copied to clipboard

Add Speaker diarization model

Open adelavega opened this issue 2 years ago • 5 comments

SpeechBrain looks promising for speaker recognition / diarization among other speech related features

adelavega avatar Mar 01 '23 19:03 adelavega

https://speechbrain.readthedocs.io/en/latest/index.html

adelavega avatar Mar 01 '23 20:03 adelavega

A potential: https://ufarooqi.com/speaker-diarization-for-whisper-transcripts/

adelavega avatar Mar 01 '23 20:03 adelavega

Looks like speaker diarization is not great yet, especially w/ unknown number of speakers

adelavega avatar Mar 01 '23 20:03 adelavega

I can attest to the quality of of Rev.ai speaker diarization, though at the moment it only comes as a package with transcription jobs. 😄

For free/open source, I've also seen some decent results with https://github.com/pyannote/pyannote-audio compared to speechbrain

qmac avatar Mar 01 '23 20:03 qmac

Thanks! Actually for our purposes I really wouldn't mind just paying for Rev on occasion. Relatively small amount of data.

adelavega avatar Mar 01 '23 20:03 adelavega