llama.cpp
llama.cpp copied to clipboard
Little confuse in quantize.cpp
Line 13-20 in quanzize.cpp,
The printf function is not match the sequnce in current master e0305ea. While ggml.h and llama.h are using different numbers to call quantisations.
that master move Q8_0 from 5 to 6 in ggml.h line 208 to 209.
While let me confuse to see that 6 is q4_3 or q8_0.
After reading more code I think the 6 is doing q4_3 as the same as the printf showed.
the llama.h LLAMA_FTYPE_MOSTLY Has the numbers not matching to ggml.h GGML_TYPE