GroundingDINO RandomResize during inference?

RandomResize during inference?

Open eugeneteoh opened this issue 1 year ago • 5 comments

Quick question. Why do you do RandomResize during inference?

https://github.com/IDEA-Research/GroundingDINO/blob/2b62f419c292ca9c518daae55512fabc3fead4a4/demo/inference_on_a_image.py#L64

Feb 01 '24 20:02 eugeneteoh

Quick question. Why do you do RandomResize during inference?

https://github.com/IDEA-Research/GroundingDINO/blob/2b62f419c292ca9c518daae55512fabc3fead4a4/demo/inference_on_a_image.py#L64

We can only set one scale in its args to make it into a single size transform here

Feb 02 '24 18:02 rentainhe

Tracking down RandomResize implementation, the images are resized to a fixed square size of 800x800; no randomness is employed.

Note that it is [800] not 800, the former is a list of random sizes to choose from, whereas the later is a minimum limit on the sizes to use. Since we choose a random number from a list of one value only, the size is always fixed to 800.

Aug 06 '24 23:08 m-hasan-n

I tried feeding images a few times and found that the images were resized while keeping the original ratio. The short side of the image will be resized to 800 if doing so the long side of the resized image will be less than or equal to 1333. Otherwise, the long side will be resized to 1333.

Sep 04 '24 08:09 shojint

GroundingDINO GroundingDINO copied to clipboard

RandomResize during inference?

GroundingDINO
GroundingDINO copied to clipboard