pliers
pliers copied to clipboard
Add Speaker diarization model
SpeechBrain looks promising for speaker recognition / diarization among other speech related features
https://speechbrain.readthedocs.io/en/latest/index.html
A potential: https://ufarooqi.com/speaker-diarization-for-whisper-transcripts/
Looks like speaker diarization is not great yet, especially w/ unknown number of speakers
I can attest to the quality of of Rev.ai speaker diarization, though at the moment it only comes as a package with transcription jobs. 😄
For free/open source, I've also seen some decent results with https://github.com/pyannote/pyannote-audio compared to speechbrain
Thanks! Actually for our purposes I really wouldn't mind just paying for Rev on occasion. Relatively small amount of data.