aimet icon indicating copy to clipboard operation
aimet copied to clipboard

How to get a Qoperator format ONNX model after quantization?

Open JiliangNi opened this issue 11 months ago • 4 comments

How to get a Qoperator format ONNX model after quantization?

JiliangNi avatar Mar 18 '24 02:03 JiliangNi

Hi @JiliangNi

I'm not aware of Qoperator support in AIMET. However, you can obtain QDQ format in your ONNX using use_embedded_encodings=true with AIMET's ONNX export feature. If you're unfamiliar with QDQ format, you can find more information in this link

PS: Please note that AIMET QDQ format is supported only for int8 quantization (W8A8) due to limitation in the ONNX opset version related to the Torch version (1.13) of AIMET

e-said avatar Mar 18 '24 19:03 e-said

So AIMET does not support Qoperator format in ONNX, right?

Is there any method which could convert QDQ to Qoperator in ONNX?

JiliangNi avatar Mar 19 '24 02:03 JiliangNi

@JiliangNi, we don't have the support to convert currently.

quic-mangal avatar Mar 25 '24 16:03 quic-mangal

So AIMET does not support Qoperator format in ONNX, right?

Is there any method which could convert QDQ to Qoperator in ONNX?

Hi @JiliangNi, have you found the method can convert QDQ to QOperator? Thank you.

hoangtv2000 avatar Jun 23 '24 12:06 hoangtv2000