insanely-fast-whisper Better Diarization pipeline

For some audio files, the diarization works, while for others they do not. If I run the audio file that didn't work with only transcription, no diarization, then it works perfectly. I have attached 2 screenshots showcasing the difference.

Audio file that works:

Audio file that does not work:

I believe the issue is in diarize.py. I found someone with the same error in another repo as well (https://github.com/huggingface/speechbox/issues/28), but their solution did not work for me. A person did try to solve it, but the repo is different. Here is his solution (https://github.com/huggingface/speechbox/pull/35).

Dec 14 '23 19:12 omarsiddiqi224

Hey @omarsiddiqi224 - I just released a patch. Which should fix this. Can you please run the following:

pipx uninstall insanely-fast-whisper
pipx install insanely-fast-whisper==0.0.13 --force

Dec 14 '23 20:12 Vaibhavs10

Thanks a lot. I had the same problem and it seemed to have fixed the problem.

Dec 14 '23 21:12 oliverwehrens

Thank you, yes it worked

Dec 14 '23 22:12 omarsiddiqi224

Be careful about this so called Fix. As it is the same exact code used in speechbox (I wonder why the speechbox library is not directly integrated in this repo instead?) the speakers assigned won't be the good one for the reasons I mentioned in the above cited issue.

This workaround just ignore an underlying problem and for an audio long enough the results will be totally wrong. But that is my 2 cents.

Dec 15 '23 04:12 Pikauba

Hey hey @Pikauba - The reason I added it over here is so that we can maintain it better, since, Speechbox isn't actively maintained at the moment.

Also, note that we will be rolling out a new production-ready whisper package in January.

I'd be open to add it here, if you would like to raise a similar PR here as well!

Thank you for your brilliant contribution! 🤗

Dec 16 '23 12:12 Vaibhavs10

Any word on when we can expect this new whisper package?

Jan 17 '24 10:01 FredHaa

insanely-fast-whisper insanely-fast-whisper copied to clipboard

Better Diarization pipeline

insanely-fast-whisper
insanely-fast-whisper copied to clipboard