aimet issue about the exported onnx model

I use the AIMET PTQ to quantize the CLIP text model.

But I encounter this error [KeyError: 'Graph has no buffer /text_model/encoder/layers.0/layer_norm1/Constant_output_0, referred to as input for text_model.encoder.layers.0.layer_norm1#2'] when using the qnn-onnx-converter to convert the onnx model.

I check the node in the exported onnx model with Netron, I find the second input of the Pow is disappear. It is just constant node with value 2.0.

Nov 30 '23 02:11 czy2014hust

By the way, the AIMET commit: 35e588226f02 with torch-gpu.

Nov 30 '23 02:11 czy2014hust

+1. I got this error with aimet1.29.0 too.

Dec 01 '23 09:12 zzh-www

Hi @czy2014hust Can you please tell us which QNN version are you using to convert ONNX models?

Dec 04 '23 04:12 quic-hitameht

QNN version：2.16.4.231110

Dec 04 '23 09:12 czy2014hust

+1

Dec 04 '23 09:12 shiyuetianqiang

+1. Any suggestions will be appreciated.

Dec 04 '23 09:12 Aaron4Fun

I found that the problem sovled by converting it with aimet 1.25.0.

Dec 04 '23 09:12 zzh-www

Thanks for sharing your QNN version. I was under the impression that this issue is resolved with QNN version you are using. But let me check and get back to you on this.

Meanwhile, you can use Pytorch 1.9 variant of AIMET 1.29 release. https://github.com/quic/aimet/releases/tag/1.29.0

Background: The root cause for this issue is that Pytorch 1.13 version applies bunch of optimizations to ONNX graph and provides more optimized graphs when the mode is set to EVAL. This includes constant folding, Conv+BN fusion etc.

In this case, the optimization has found identical values for LayerNorm parameters and replaces them with a single value, so it ends up with only one initializer in the ONNX graph and it messes up the QNN conversion step.

Dec 04 '23 09:12 quic-hitameht

Thank you for your reply. I met the same issue with QNN latest version 2.17.0.231124.

Dec 04 '23 12:12 czy2014hust

Thanks for sharing your QNN version. I was under the impression that this issue is resolved with QNN version you are using. But let me check and get back to you on this.

Meanwhile, you can use Pytorch 1.9 variant of AIMET 1.29 release. https://github.com/quic/aimet/releases/tag/1.29.0

Background: The root cause for this issue is that Pytorch 1.13 version applies bunch of optimizations to ONNX graph and provides more optimized graphs when the mode is set to EVAL. This includes constant folding, Conv+BN fusion etc.

In this case, the optimization has found identical values for LayerNorm parameters and replaces them with a single value, so it ends up with only one initializer in the ONNX graph and it messes up the QNN conversion step.

Hi,Does this issue exist for version AIMET-1.29?

Dec 11 '23 09:12 czy2014hust

you need to change the onnx simplifier to onnx_utils.simplify_onnx_model = False

Feb 06 '24 17:02 theoctopusride

May I ask how this problem was ultimately resolved

Mar 26 '24 03:03 MisterTab

@czy2014hust

Mar 26 '24 03:03 MisterTab

I meet this error in aimet_torch 1.31.0. Any suggestions to solve this problem?

Name: aimet-torch Version: torch-gpu-1.31.0 Summary: AIMET torch Package Home-page: https://github.com/quic/aimet/releases/download/torch-gpu_1.31.0 Author: Qualcomm Innovation Center, Inc. Author-email: [email protected] License: NOTICE.txt

May 14 '24 07:05 1826133674

you need to change the onnx simplifier to onnx_utils.simplify_onnx_model = False

Thank you for your reply！I'v solve it according your tips! By the way, the code is in file xx/aimet_quantsim.py def export_quantsim_model(qsim, output_path, dummy_input, filename_prefix, verbose=False, opset_version=11, use_external_data_format=False, input_names=None, output_names=None):

onnx_utils.update_all_onnx_nodes_name = False
onnx_utils.simplify_onnx_model = False

May 21 '24 09:05 1826133674

aimet aimet copied to clipboard

issue about the exported onnx model

aimet
aimet copied to clipboard