openvino
openvino copied to clipboard
[CPU] Support weight-compression dt s8
Details:
- FC with symmetrically quantized/compressed weight may have i8 (instead of u8) as weight data type (it saves the zero-point subtraction cost), this change added support to such weight dt.
- oneDNN fork PR: https://github.com/openvinotoolkit/oneDNN/pull/249
Tickets:
- CVS-131042