ggml
ggml copied to clipboard
Why perform such operations on k and v as shown in the above diagram?
Why perform such operations on k and v as shown in the above diagram?
In order to copy and persist the current key and values (Kcur and Vcur) to the kv cache.