Grounded-Segment-Anything Is the anyway that I could detect everything that the model knew in the picture?

Is the anyway that I could detect everything that the model knew in the picture?

Open aixiaodewugege opened this issue 2 years ago • 3 comments

Like the usual model can do like yolov5, to detect everything.

May 05 '23 05:05 aixiaodewugege

Like the usual model can do like yolov5, to detect everything.

In theory, model can detect every language input, for the common case like 80 categories in COCO, we evaluate GroundingDINO by concat all the category name with ., you may input the language prompt as:

person. cat. dog. ...

And see if the model can detect them correctly, you should also be careful with the box threshold and text threshold, which may influence the output results

May 05 '23 06:05 rentainhe

Thanks for your reply. How does the text threshold work? What will be effected by it?

May 05 '23 06:05 aixiaodewugege

Hi @aixiaodewugege , Hope you have already found the answer. For others, here is the helpful link for input/output https://github.com/IDEA-Research/GroundingDINO#star-explanationstips-for-grounding-dino-inputs-and-outputs

Dec 22 '23 14:12 abhishek0093

Grounded-Segment-Anything Grounded-Segment-Anything copied to clipboard

Is the anyway that I could detect everything that the model knew in the picture?

Grounded-Segment-Anything
Grounded-Segment-Anything copied to clipboard