GroundingDINO
GroundingDINO copied to clipboard
REC functionality
Hi and thanks for you're awesome work on Grounding DINO. I mostly use Grounding DINO to label different object classes, which works really nicely.
Now I tried to experiment with the models REC functionality using the Swin-T checkpoint, though Iam not able to get it working properly. I tried both the image with the different colored cats and the image with the three lions.
E.g., I use the example prompt from the paper for the lion image "The left lion". This does not work at all. And this has been the experience for almost all images and prompts.
Now Iam wondering if Iam doing something wrong?
Thanks in advance,
kind regards,
M
I met the same problem .
Try lowering your text_threshold to ~0.01