LightCompress
LightCompress copied to clipboard
Maybe a bug in "allow_padding"
https://github.com/ModelTC/llmc/blob/main/llmc/compression/quantization/quant.py Line 629: deficiency = self.group_size - tensor.shape[1] % self.group_size tensor.shape[-1] is ok?