whisper-streamlit
whisper-streamlit copied to clipboard
For better transcription in more languages, Implement Massively Multilingual Speech - Meta's Open Source model with less than half of Whispers error rate
Because of the error rate viz and above al speaker detection your whisper ui is better for research use than all the others I have tried. Please consider implementing Meta's MMS with speech recognition and generation support for over 1000 languages at a drastically reduced error rate compared to Whisper:
https://github.com/facebookresearch/fairseq/tree/main/examples/mms
https://ai.facebook.com/blog/multilingual-model-speech-recognition/