Christian Huettig
Results
11
comments of
Christian Huettig
I'm also using Threadripper with active NUMA and see even a larger boost, almost 80% faster with llama.cpp when using the same model with --numa=distribute . Started experimenting when the...