GGUF quantization meta-data format

Open mobicham opened this issue 1 year ago • 2 comments

Hello!

Are there some resources that explain how the quantized parameters are structured in a GGUF file? We are interested in porting HQQ-quantized models into GGUF format, but in order to do that, we need to know exactly how it is stored. We basically need to know:

The bitpacking logic
axis along which quantization is done
group-sizes associated with different quant types

Thanks!

Apr 14 '24 12:04 mobicham

Hi, you would better have a look at llama.cpp :

https://github.com/ggerganov/llama.cpp/blob/f184dd920852d6d372b754f871ee06cfe6f977ad/llama.cpp#L13599

Apr 14 '24 13:04 phymbert

@mobicham here is the spec for GGUF for you to use: https://github.com/ggerganov/ggml/blob/master/docs/gguf.md

Apr 15 '24 19:04 crimson-knight