ggml
ggml copied to clipboard
Profiling oddity - why so slow sometimes?
Often when I attempt to profile the llama.cpp main process using Instruments, the process drags to a crawl with a vast majority of the time spent in ggml_graph_compute_thread:
I cannot reliably reproduce the condition. I was wondering if anyone had seen similar issues and whether there might be a workaround.