neural-compressor support conv1d in quantization algorithms

support conv1d in quantization algorithms

Open wenhuach21 opened this issue 1 year ago • 1 comments

Several models, such as LaMini-GPT, are utilizing this layer, but unfortunately, most of our algorithms do not currently support it. W8A8: SQ weight-only: RTN, TEQ

better support tranformers.conv1d and torch.conv1d both

Oct 23 '23 02:10 wenhuach21

conv1d support PR was merged

May 20 '24 08:05 thuang6

neural-compressor neural-compressor copied to clipboard

support conv1d in quantization algorithms

neural-compressor
neural-compressor copied to clipboard