LightCompress
LightCompress copied to clipboard
get_float_qparams 忽略量化粒度?
get_float_qparams里求出来的tensor和scales形状永远一样,量化粒度:per-tensor/group/channel失效了?
Another bug
https://github.com/ModelTC/llmc/blob/main/llmc/compression/quantization/quant.py#L979 起至init函数结束,不该缩进
@chengtao-lv