Maybe a bug in "allow_padding"

Open pengyao96 opened this issue 9 months ago • 0 comments

https://github.com/ModelTC/llmc/blob/main/llmc/compression/quantization/quant.py Line 629: deficiency = self.group_size - tensor.shape[1] % self.group_size tensor.shape[-1] is ok?

Apr 08 '25 12:04 pengyao96