Johannes Gäßler
Johannes Gäßler
Does it work with a plain text file?
I did not explicitly check the effect of FP16/BF16 as an intermediary but when using them directly I basically found no relevant differences: https://github.com/ggerganov/llama.cpp/pull/7150 . And because the FP16 vs....
The design philosophy of ggml/llama.cpp is **not** to use external dependencies if at all possible. I was recently informed by an NVIDIA engineer that the way to go for tensor...
Good catch with the `&`. Does the compilation work if you just remove the `const`?
>Write more information on the Wikipedia article about how llama.cpp works, citing reliable secondary and tertiary sources. I don't think there are any (for the technical aspects). For that primary...