ggml
ggml copied to clipboard
ggml : add NUMA support
See https://github.com/ggerganov/llama.cpp/pull/1556
The PR not only adds NUMA, but also improves the threading logic in ggml
which looks like brings significant speed-up.
Needs to be verified for correctness before merging