Christian Huettig

Results 11 comments of Christian Huettig

I'm also using Threadripper with active NUMA and see even a larger boost, almost 80% faster with llama.cpp when using the same model with --numa=distribute . Started experimenting when the...