LFD-A-Light-and-Fast-Detector why trt mode consumed more GPU and MEM space？

why trt mode consumed more GPU and MEM space？

Open chainfitness opened this issue 2 years ago • 2 comments

thx this awesome work! I just want to deploy this awesome work in practice, but i find that trt mode consume more GPU and MEM space.The GPU refered above is Tesla T4, i'd appreciate if the prompt repley.

Mar 02 '22 02:03 chainfitness

@chainfitness We do not encounter your case before. Would you provide more information?

Mar 03 '22 11:03 YonghaoHe

I'm obilged for quikly reply.I want to deploy face detection from this work to nvidia tx platform. But the official installing TensorRT version from TX is 8.0.1.6,which is too new to run predict_tensorrt.py. So,i move code to Tesla T4 that installed TensorRT 7.2.2.3, it works! But i find that trt mode consumed more GPU and MEM space, GPU and MEM consumption running predict.py was 977M , 2123M respectively,however, running predict_tensorrt.py was 1486M and 3714M. The pretrained weight was downloaded from predict_tensorrt.py is only modified the path of pretrained weight.

Mar 04 '22 02:03 chainfitness

LFD-A-Light-and-Fast-Detector LFD-A-Light-and-Fast-Detector copied to clipboard

why trt mode consumed more GPU and MEM space？

LFD-A-Light-and-Fast-Detector
LFD-A-Light-and-Fast-Detector copied to clipboard