tensorrt icon indicating copy to clipboard operation
tensorrt copied to clipboard

No significant performance improvement on UNET semantic segmentation model.

Open dhinkris opened this issue 5 years ago • 6 comments

Hi, I used TFTRT to optimize the 3D UNET segmentation. I couldn't find any significant improvement in the speed. On a P100 GPU, for a batch size of 4 averaged for 20 runs: i) keras model = 0.78s ii) TFTRT-FP32 = 0.74s iii) TFTRT-FP16 = 0.74s

Can somebody help if this can be optimized? Thank you

dhinkris avatar Nov 13 '19 21:11 dhinkris