Alpaca icon indicating copy to clipboard operation
Alpaca copied to clipboard

More speech-to-text models for dictation purposes

Open Mk-N opened this issue 4 months ago • 4 comments

Currently only OpenAI whisper models can be chosen. Furthermore, the turbo whisper model cannot be chosen and it is unclear whether the large model refers to large v1, v2 or v3.

The ability to have a greater choice of models from different companies would be a nice to have.

Mk-N avatar Aug 17 '25 10:08 Mk-N

The list of models come from this part of the readme

https://github.com/openai/whisper?tab=readme-ov-file#available-models-and-languages

Jeffser avatar Aug 18 '25 16:08 Jeffser

Hi Jeffser, I don't see German on Alpaca, is it a bug?

Thank you!

linuxkernel94 avatar Aug 18 '25 22:08 linuxkernel94

Edit: I don't see it on text to speech, I can see it on speech to text 😅

linuxkernel94 avatar Aug 18 '25 22:08 linuxkernel94

Hi @linuxkernel94 unfortunately Kokoro, the modelset I'm using for voices doesn't have any voices in German.

I also don't include Japanese and Chinese because it requires additional libraries that are way too big

Jeffser avatar Aug 18 '25 23:08 Jeffser