Yet-Another-EfficientDet-Pytorch icon indicating copy to clipboard operation
Yet-Another-EfficientDet-Pytorch copied to clipboard

problem with efficientdet_test.py?

Open Jinjicheng opened this issue 4 years ago • 4 comments

I run efficientdet_test.py with d0, but the result is different with your in test folder.,there are a lot of missed objects. And the fps is only 18 with Tesla V100.

Jinjicheng avatar May 06 '20 07:05 Jinjicheng

set force_input_size=1920. Official repo uses original image size while this repo uses default network input size. If you try to compare these two repos, you must make sure the input size is consistent.

And no idea why V100 is so slow. Please provide the minimal code to replicate the speed.

zylo117 avatar May 06 '20 08:05 zylo117

I can confirm the following speeds using a V100, using the provided efficientdet_test.py script in the repository, with some different configurations

FPS Model Input size
21.3 d0 default
17.1 d0 1920
4.9 d7 default
3.3 d7 1920

This has produced the following outputs

D0 using force_input_size=1920

img_inferred_d0_this_repo_0

D7 using force_input_size=1920

img_inferred_d7_this_repo_0

The smaller d0 network seems more inclined to predicting people over motorcycles, whereas the larger d7 model seems to favor motorcycles. D0 also contains more false positives than the larger d7 model, but has more objects overall as well. The d0 image seems to match that which is provided in the repository.

mikkeldenker avatar May 06 '20 16:05 mikkeldenker

Still no idea why V100 is so slow. Maybe it's torch version? Mine is torch1.4.

Testing on 1920 is a mistake. But since official author did it. I did it too. You should test the networks at their default resolution. That's only fair.

zylo117 avatar May 07 '20 01:05 zylo117

Still no idea why V100 is so slow. Maybe it's torch version? Mine is torch1.4.

Testing on 1920 is a mistake. But since official author did it. I did it too. You should test the networks at their default resolution. That's only fair.

I use RTX 3080, force_input_size=False, using the the provided efficientdet_test.py script in the repository , the speed is 15 fps.(63ms). The most time is spend in the backbonenet. I don't know how to get you offer the speed.

bloom1123 avatar Dec 02 '21 12:12 bloom1123