neural-compressor
neural-compressor copied to clipboard
support conv1d in quantization algorithms
Several models, such as LaMini-GPT, are utilizing this layer, but unfortunately, most of our algorithms do not currently support it. W8A8: SQ weight-only: RTN, TEQ
better support tranformers.conv1d and torch.conv1d both
conv1d support PR was merged