onnx-tensorrt icon indicating copy to clipboard operation
onnx-tensorrt copied to clipboard

Question about converting onnx quantized model to tensorrt

Open Nedae opened this issue 4 years ago • 1 comments

I am trying to convert an already quantized onnx model to tensorrt! onnx version : 1.7.0 tensorrt version: 7.0.0.11

When I try to parse my quantized onnx network, I get the following error In node 1 (parseGraph): UNSUPPORTED_NODE: No importer registered for op: QLinearConv.

In the list of Tensorrt supported onnx operators here https://github.com/onnx/onnx-tensorrt/blob/master/operators.md, I can see that QlinearConv is not supported.

Is there any guideline on how to convert quantized onnx model to trt?

Nedae avatar Nov 05 '20 20:11 Nedae

Hi @Nedae, we only support QuantizeLinear and DequantizeLinear operators in TensorRT 7.2. We are planning to extend support and provide better tooling for prequantized networks in the next TensorRT release.

kevinch-nv avatar Apr 23 '21 21:04 kevinch-nv