onnx-tensorrt onnx2trt: No speed difference between models of different sizes.

onnx2trt: No speed difference between models of different sizes.

Open Fschoeller opened this issue 4 years ago • 3 comments

I have two yolov5 models of different sizes. One has 35.9m parameters, the other 12.7m. When I convert the models to TensorRT with trtexec --onnx=model.onnx --batch=5 --fp16 the resulting models have roughly the same inference speed (21 fps) even though the speed should be vastly different. What am I doing wrong?

Jun 24 '21 18:06 Fschoeller

What TRT version are you using? Are you able to provide the models you are benchmarking?

Jun 28 '21 18:06 kevinch-nv

I'm using TensorRT 7.1.3 on the Jetson Xavier AGX. Would you like the models as ONNX files?

Jun 28 '21 19:06 Fschoeller

Yes, proving the models in ONNX form will be useful.

Are you seeing the same performance difference with the latest version of TRT?

Jun 16 '22 19:06 kevinch-nv

onnx-tensorrt onnx-tensorrt copied to clipboard

onnx2trt: No speed difference between models of different sizes.

onnx-tensorrt
onnx-tensorrt copied to clipboard