TensorRT Convert a quantized conv3d model failed

Description

I'm using pytorch quantization toolkit to quantize my model, which has some conv3d module. The QAT procedure is OK. But when i use trtexec to convert the onnx with Q/DQ pairs to engine, it failed! I have written some simple code to reproduce the error.

Environment

TensorRT Version: 8.2.5.1 NVIDIA GPU: TITAN RTX NVIDIA Driver Version: 470.103.01 CUDA Version: 11.4 CUDNN Version: Operating System: ubuntu 20.04 Python Version (if applicable): 3.8.10 Tensorflow Version (if applicable): PyTorch Version (if applicable): 1.9.1+cu102 Baremetal or Container (if so, version):

Relevant Files

import torch
import torch.nn as nn
from pytorch_quantization import quant_modules
import pytorch_quantization.nn as quant_nn
quant_modules.initialize()

model = nn.Conv3d(3, 16, kernel_size=(3, 3, 3), stride=(1, 1,1), padding=(1, 1, 1))
quant_nn.TensorQuantizer.use_fb_fake_quant = True
torch.onnx.export(model, torch.randn(1, 3, 20, 20, 20), 'conv3d_quant.onnx', opset_version=13)

Steps To Reproduce

use above code to generate a conv3d onnx with q/dq pair, it should look like this. 2022-07-06 20-23-08 的屏幕截图

Then use

trtexec --onnx=conv3d_quant.onnx --int8 --noTF32 --saveEngine=conv3d_quant.engine

it will throw a error 2022-07-06 20-25-50 的屏幕截图

I guess tensorrt doesn't support conv3d onnx with q/dq pair？

Jul 06 '22 12:07 YohannXu

I can't export the onnx using your script, can you share the onnx model here?

$ python3 2125.py
Traceback (most recent call last):
  File "2125.py", line 8, in <module>
    torch.onnx.export(model. torch.randn(1, 3, 20, 20, 20), 'conv3d_quant.onnx', opset_version=13)
  File "/home/scratch.zeroz_sw/miniconda3/envs/transformer/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1185, in __getattr__
    raise AttributeError("'{}' object has no attribute '{}'".format(
AttributeError: 'QuantConv3d' object has no attribute 'torch'

Jul 06 '22 14:07 zerollzeng

cc @ttyio

Jul 06 '22 14:07 zerollzeng

@zerollzeng i fixed code above, you can try it again

Jul 06 '22 15:07 YohannXu

could you try adding Q/DQ after the Conv? I am not sure if we have tactics for INT8-input FP32-output 3D convs (will need to check internally).

Jul 07 '22 00:07 nvpohanh

you mean a mode like this? 2022-07-07 18-14-22 的屏幕截图 it doesn't work. 2022-07-07 18-17-26 的屏幕截图 @nvpohanh

Jul 07 '22 10:07 YohannXu

Thanks for trying. We will debug this. @zerollzeng Could you give it a try with TRT 8.2 and TRT 8.4? Let's see if it's already fixed in TRT 8.4. Thanks

Jul 07 '22 10:07 nvpohanh

@YohannXu I still met error using your export code:

import torch
from pytorch_quantization import quant_modules
import pytorch_quantization.nn as quant_nn
quant_modules.initialize()

model = nn.Conv3d(3, 16, kernel_size=(3, 3, 3), stride=(1, 1,1), padding=(1, 1, 1))
quant_nn.TensorQuantizer.use_fb_fake_quant = True
torch.onnx.export(model, torch.randn(1, 3, 20, 20, 20), 'conv3d_quant.onnx', opset_version=13)

$ python3 2125.py
Traceback (most recent call last):
  File "2125.py", line 6, in <module>
    model = nn.Conv3d(3, 16, kernel_size=(3, 3, 3), stride=(1, 1,1), padding=(1, 1, 1))
NameError: name 'nn' is not defined

Jul 07 '22 13:07 zerollzeng

Looks like they are not modified.

Jul 07 '22 13:07 zerollzeng

sry, i forgot to import torch.nn, it now works @zerollzeng

Jul 07 '22 13:07 YohannXu

I can't reproduce it in TRT 8.4, can you try 8.4?

TensorRT-8.4.1.5/bin/trtexec --onnx=conv3d_quant.onnx --int8

Jul 07 '22 13:07 zerollzeng

i tried 8.4, but the error still occur. @zerollzeng 2022-07-08 11-00-23 的屏幕截图 maybe you can try my onnx model model.zip

Jul 08 '22 03:07 YohannXu

I update the library, it now works. Thanks @zerollzeng

Jul 08 '22 03:07 YohannXu

For a simple onnx above, it now works. But for a complex onnx, the error occur again at the first conv3d op. model.zip @zerollzeng

Jul 08 '22 07:07 YohannXu

TensorRT TensorRT copied to clipboard

Convert a quantized conv3d model failed

Description

Environment

Relevant Files

Steps To Reproduce

TensorRT
TensorRT copied to clipboard