Johannes Gäßler

Results 235 comments of Johannes Gäßler

Does it work with a plain text file?

I did not explicitly check the effect of FP16/BF16 as an intermediary but when using them directly I basically found no relevant differences: https://github.com/ggerganov/llama.cpp/pull/7150 . And because the FP16 vs....

The design philosophy of ggml/llama.cpp is **not** to use external dependencies if at all possible. I was recently informed by an NVIDIA engineer that the way to go for tensor...

Good catch with the `&`. Does the compilation work if you just remove the `const`?

>Write more information on the Wikipedia article about how llama.cpp works, citing reliable secondary and tertiary sources. I don't think there are any (for the technical aspects). For that primary...