llama.cpp icon indicating copy to clipboard operation
llama.cpp copied to clipboard

Little confuse in quantize.cpp

Open FNsi opened this issue 1 year ago • 2 comments

Line 13-20 in quanzize.cpp,

The printf function is not match the sequnce in current master e0305ea. While ggml.h and llama.h are using different numbers to call quantisations.

FNsi avatar Apr 21 '23 03:04 FNsi

that master move Q8_0 from 5 to 6 in ggml.h line 208 to 209.

While let me confuse to see that 6 is q4_3 or q8_0.

After reading more code I think the 6 is doing q4_3 as the same as the printf showed.

FNsi avatar Apr 21 '23 03:04 FNsi

the llama.h LLAMA_FTYPE_MOSTLY Has the numbers not matching to ggml.h GGML_TYPE

FNsi avatar Apr 21 '23 04:04 FNsi