whisper-streamlit icon indicating copy to clipboard operation
whisper-streamlit copied to clipboard

For better transcription in more languages, Implement Massively Multilingual Speech - Meta's Open Source model with less than half of Whispers error rate

Open menelic opened this issue 1 year ago • 2 comments

Because of the error rate viz and above al speaker detection your whisper ui is better for research use than all the others I have tried. Please consider implementing Meta's MMS with speech recognition and generation support for over 1000 languages at a drastically reduced error rate compared to Whisper:

image

https://github.com/facebookresearch/fairseq/tree/main/examples/mms

https://ai.facebook.com/blog/multilingual-model-speech-recognition/

menelic avatar May 26 '23 10:05 menelic