whisper.cpp
whisper.cpp copied to clipboard
Attempt to improve threading in ggml
ref #200
Instead of creating and joining new threads for each graph compute, we create a thread pool with active threads and make them wait on a condition variable instead of joining.
Surprisingly, this does not seem to improve the performance. It is actually degraded. Not sure what is going on.