neural-compressor icon indicating copy to clipboard operation
neural-compressor copied to clipboard

support conv1d in quantization algorithms

Open wenhuach21 opened this issue 1 year ago • 1 comments

Several models, such as LaMini-GPT, are utilizing this layer, but unfortunately, most of our algorithms do not currently support it. W8A8: SQ weight-only: RTN, TEQ

better support tranformers.conv1d and torch.conv1d both

wenhuach21 avatar Oct 23 '23 02:10 wenhuach21

conv1d support PR was merged

thuang6 avatar May 20 '24 08:05 thuang6