whisper.el [FR] Use `server` to make inference faster

[FR] Use `server` to make inference faster

Open NightMachinery opened this issue 7 months ago • 1 comments

whisper.cpp ships with a server. Isn't using that faster than loading the model again for each request?

Doing this should be much easier than https://github.com/natrys/whisper.el/issues/22.

Jul 02 '24 21:07 NightMachinery