Lidar_AI_Solution icon indicating copy to clipboard operation
Lidar_AI_Solution copied to clipboard

Running time

Open super-liuyang opened this issue 2 years ago • 8 comments

I am using the Jetson AGX Orin 64G with a power consumption of maxn. However, when I tested the examples you provided, it took 50ms to process one image. Are the six images processed in parallel? If not, how do you achieve an output of up to 28fps?

super-liuyang avatar Jan 09 '24 12:01 super-liuyang

  1. Just to correct you, we only achieve 25fps, not 28fps.
  2. Inputs include an image tensor(1x6x3x256x704) and a lidar points tensor.
  3. We test on Drive ORIN 64G only.

Could you share your time report with me?

Thanks

hopef avatar Jan 11 '24 06:01 hopef

@hopef So my question is whether you process the six images using a parallel method or a serial method. If you use a parallel method, then the processing time is 40ms, right? 2024-01-06 17-56-23 的屏幕截图

super-liuyang avatar Jan 11 '24 06:01 super-liuyang

Yes, we only use a parallel method to process 6 images.

Because they can be packed into a batch and fed into TRTEngine.

hopef avatar Jan 11 '24 06:01 hopef

@hopef However, my average processing time is 50ms, and my power consumption has reached the maximum. How can I increase the frame rate?

super-liuyang avatar Jan 11 '24 06:01 super-liuyang

@hopef I also want to know if there is a difference in processing speed between the Python interface and the C++ interface.

super-liuyang avatar Jan 11 '24 06:01 super-liuyang

Could you provide your running environment details? like TensorRT Version, JetsonPackage Version, CUDA Version, and Reported inference latency.

hopef avatar Jan 11 '24 06:01 hopef

@hopef TensorRT Version:8.5.2.2, JetsonPackage Version:5.1, CUDA Version:11.4 , Cudnn:8.6.0.166

super-liuyang avatar Jan 11 '24 06:01 super-liuyang

TensorRT-8.6, cuda-11.4 and cudnn8.6 may be the best choice for you.

hopef avatar Jan 11 '24 06:01 hopef