Zero Zeng comments

Results 571 comments of


                                            Zero Zeng

binary vulnerability analysis of nvinfer.dll in TensorRT 8.6

Correct.

BF16 is slower than fp16 of TensorRT 9.1 when running my R50 model on A800 GPU

@nvpohanh I guess it's expected since we have more optimized kernel for FP16, am I right?

BF16 is slower than fp16 of TensorRT 9.1 when running my R50 model on A800 GPU

> I found that even BF16 flag is set, the chosen kernels for convolution are still in FP32 precision. Per @nvpohanh 's comment, maybe the FP32 conv kernels are faster...

FP16 failure of TensorRT 8.6.1.6 when running GroundingDINO on GPU GeForce RTX 3080 Ti

@nvpohanh Will inset explicit cast works here?

FP16 failure of TensorRT 8.6.1.6 when running GroundingDINO on GPU GeForce RTX 3080 Ti

Anyone can provide a step to reproduce? Thanks!

FP16 failure of TensorRT 8.6.1.6 when running GroundingDINO on GPU GeForce RTX 3080 Ti

Requested access.

About BatchSize failure of TensorRT-8.0.1.6 when running trtexec tool on NVIDIA Jetson Xavier NX

Use `/usr/src/tensorrt/bin/trtexec --loadEngine=xx.engine --shapes=input:40x3x224x224`, because you are using explicit shape.

run resnet50 trt in python multiprocessing is error

Does the above code work if you don't use mp? looks more like a usage issue to me.

run resnet50 trt in python multiprocessing is error

> The above code alse work without mp. So there is no problem if you don't use mp. Could you please try don't use mp package but open several terminal...

Tensorrt fp32 inference slower than pytorch on tesla T4 for GroundingDINO

@nvpohanh Is this expected? (torch 650ms vs trt 590.308 ms)