sitabulaixizawaluduo

Results 5 issues of sitabulaixizawaluduo

### OpenVINO Version 2024.0.0 ### Operating System Ubuntu 22.04 (LTS) ### Device used for inference CPU ### OpenVINO installation Build from source ### Programming Language Python ### Hardware Architecture x86...

category: CPU
performance
support_request

I tested the mit-han-lab/opt-6.7b-smoothquant model and the opt-6.7b model on HuggingFace. The ppl obtained using the WikiText-2 dataset was 20.65 and 10.92, respectively. The tests were conducted on an A30...

### OpenVINO Version 2024.0.0 ### Operating System Ubuntu 22.04 (LTS) ### Device used for inference CPU ### OpenVINO installation Build from source ### Programming Language Python ### Hardware Architecture x86...

bug
category: CPU
performance
support_request

The int8 quantization is basically lossless at the moment, and although awq is good in terms of accuracy and performance, int8 is a better choice in some scenarios where accuracy...

### Checklist - [x] 1. I have searched related issues but cannot get the expected help. - [x] 2. The bug has not been fixed in the latest version. -...

deepseek