Max Krasnyansky

Results 2 issues of Max Krasnyansky

Currently Windows ARM64 builds are not properly optimized, which results in low token rates on Windows ARM64 platforms such as the upcoming Snapgradon X-Elite & Plus. This update adds /...

review complexity : high
devops

Here is an attempt at reintroducing the original whole-graph profiler (LLAMA_PERF) with some additional features. Not ready for the merge into master but useful for profiling different models (on CPU)....

ggml