aimet How to get a Qoperator format ONNX model after quantization?

How to get a Qoperator format ONNX model after quantization?

Open JiliangNi opened this issue 1 year ago • 4 comments

How to get a Qoperator format ONNX model after quantization?

Mar 18 '24 02:03 JiliangNi

Hi @JiliangNi

I'm not aware of Qoperator support in AIMET. However, you can obtain QDQ format in your ONNX using use_embedded_encodings=true with AIMET's ONNX export feature. If you're unfamiliar with QDQ format, you can find more information in this link

PS: Please note that AIMET QDQ format is supported only for int8 quantization (W8A8) due to limitation in the ONNX opset version related to the Torch version (1.13) of AIMET

Mar 18 '24 19:03 e-said

So AIMET does not support Qoperator format in ONNX, right?

Is there any method which could convert QDQ to Qoperator in ONNX?

Mar 19 '24 02:03 JiliangNi

@JiliangNi, we don't have the support to convert currently.

Mar 25 '24 16:03 quic-mangal

So AIMET does not support Qoperator format in ONNX, right?

Is there any method which could convert QDQ to Qoperator in ONNX?

Hi @JiliangNi, have you found the method can convert QDQ to QOperator? Thank you.

Jun 23 '24 12:06 hoangtv2000

aimet aimet copied to clipboard

How to get a Qoperator format ONNX model after quantization?

aimet
aimet copied to clipboard