whisper.cpp
whisper.cpp copied to clipboard
mmap support
As an experiment it would be interesting to alter the ggml format a bit and then mmap it directly. Maybe this coulde reduce memory pressure for small systems
Related: https://github.com/ggerganov/llama.cpp/issues/91