vllm
vllm copied to clipboard
Missing comment explaining VDR variable in GGUF kernels
Taken from https://github.com/ggerganov/llama.cpp/blob/3d68f034dad53f0f27ad626b2732ef48fbcea4ee/ggml/src/ggml-cuda/vecdotq.cuh#L18