multispectral-object-detection icon indicating copy to clipboard operation
multispectral-object-detection copied to clipboard

Input shape of LLVIP YOLOV5L

Open XiongZhongxia opened this issue 2 years ago • 3 comments

Thanks for your contribution! Could you please tell me the input shape for training LLVIP YOLOV5L, which achieves 97.5 [email protected] and 5.40 MR?

XiongZhongxia avatar May 10 '22 07:05 XiongZhongxia

The same question confuses me. I look forward to getting an answer.

The author provides a pre-trained checkpoint named "yolov5l_transformerx3_llvip_s1024_bs32_e200". As this reply mentioned, the author uses 1024 x 1024 image shape to train yolov5l on llvip dataset.

But the image shape in this figure shows the image shape is 640 x 640 x 3. Another clue shows that the image shape is 640 x 640 x 3.

It is obviously important to use the same image shape for an fair comparison, thus i look forward to know this point.

XueZ-phd avatar Nov 15 '22 12:11 XueZ-phd

The same question confuses me. I look forward to getting an answer.

The author provides a pre-trained checkpoint named "yolov5l_transformerx3_llvip_s1024_bs32_e200". As this reply mentioned, the author uses 1024 x 1024 image shape to train yolov5l on llvip dataset.

But the image shape in this figure shows the image shape is 640 x 640 x 3. Another clue shows that the image shape is 640 x 640 x 3.

It is obviously important to use the same image shape for an fair comparison, thus i look forward to know this point.

I can answer my question now! When training YOLOV5l on LLVIP, the image shape is 1024 x 1024 x 3. Please refer to this reply.

XueZ-phd avatar Nov 15 '22 12:11 XueZ-phd

The same question confuses me. I look forward to getting an answer. The author provides a pre-trained checkpoint named "yolov5l_transformerx3_llvip_s1024_bs32_e200". As this reply mentioned, the author uses 1024 x 1024 image shape to train yolov5l on llvip dataset. But the image shape in this figure shows the image shape is 640 x 640 x 3. Another clue shows that the image shape is 640 x 640 x 3. It is obviously important to use the same image shape for an fair comparison, thus i look forward to know this point.

I can answer my question now! When training YOLOV5l on LLVIP, the image shape is 1024 x 1024 x 3. Please refer to this reply.

Thanks a lot !

XiongZhongxia avatar Nov 20 '22 01:11 XiongZhongxia