Running time
I am using the Jetson AGX Orin 64G with a power consumption of maxn. However, when I tested the examples you provided, it took 50ms to process one image. Are the six images processed in parallel? If not, how do you achieve an output of up to 28fps?
- Just to correct you, we only achieve 25fps, not 28fps.
- Inputs include an image tensor(1x6x3x256x704) and a lidar points tensor.
- We test on Drive ORIN 64G only.
Could you share your time report with me?
Thanks
@hopef So my question is whether you process the six images using a parallel method or a serial method. If you use a parallel method, then the processing time is 40ms, right?
Yes, we only use a parallel method to process 6 images.
Because they can be packed into a batch and fed into TRTEngine.
@hopef However, my average processing time is 50ms, and my power consumption has reached the maximum. How can I increase the frame rate?
@hopef I also want to know if there is a difference in processing speed between the Python interface and the C++ interface.
Could you provide your running environment details? like TensorRT Version, JetsonPackage Version, CUDA Version, and Reported inference latency.
@hopef TensorRT Version:8.5.2.2, JetsonPackage Version:5.1, CUDA Version:11.4 , Cudnn:8.6.0.166
TensorRT-8.6, cuda-11.4 and cudnn8.6 may be the best choice for you.