tvm [QNN][ONNX-Frontent] Fix error of reading zero_point during per-channel quantization

[QNN][ONNX-Frontent] Fix error of reading zero_point during per-channel quantization

Open MPolaris opened this issue 1 year ago • 0 comments

Quantification is usually divided into two modes: per-channel or per-tensor. For per-channel zero_point and scale are 1d array and it's length same as tensor channel. For per-tensor zero_point and scale are scalar. At present, when using onnx-qnn in the frontend, tvm seems to only consider per-sensor quantization. Additionally, for both quantization modes, read zero_point can be used in this way without causing errors.

Jan 26 '24 13:01 MPolaris

tvm tvm copied to clipboard

[QNN][ONNX-Frontent] Fix error of reading zero_point during per-channel quantization

tvm
tvm copied to clipboard