llama.cpp Little confuse in quantize.cpp

Little confuse in quantize.cpp

Open FNsi opened this issue 1 year ago • 2 comments

Line 13-20 in quanzize.cpp,

The printf function is not match the sequnce in current master e0305ea. While ggml.h and llama.h are using different numbers to call quantisations.

Apr 21 '23 03:04 FNsi

that master move Q8_0 from 5 to 6 in ggml.h line 208 to 209.

While let me confuse to see that 6 is q4_3 or q8_0.

After reading more code I think the 6 is doing q4_3 as the same as the printf showed.

Apr 21 '23 03:04 FNsi

the llama.h LLAMA_FTYPE_MOSTLY Has the numbers not matching to ggml.h GGML_TYPE

Apr 21 '23 04:04 FNsi