Max Krasnyansky
Results
2
issues of
Max Krasnyansky
Currently Windows ARM64 builds are not properly optimized, which results in low token rates on Windows ARM64 platforms such as the upcoming Snapgradon X-Elite & Plus. This update adds /...
review complexity : high
devops
Here is an attempt at reintroducing the original whole-graph profiler (LLAMA_PERF) with some additional features. Not ready for the merge into master but useful for profiling different models (on CPU)....
ggml