Grounded-Segment-Anything icon indicating copy to clipboard operation
Grounded-Segment-Anything copied to clipboard

Is the anyway that I could detect everything that the model knew in the picture?

Open aixiaodewugege opened this issue 2 years ago • 3 comments

Like the usual model can do like yolov5, to detect everything.

aixiaodewugege avatar May 05 '23 05:05 aixiaodewugege

Like the usual model can do like yolov5, to detect everything.

In theory, model can detect every language input, for the common case like 80 categories in COCO, we evaluate GroundingDINO by concat all the category name with ., you may input the language prompt as:

person. cat. dog. ...

And see if the model can detect them correctly, you should also be careful with the box threshold and text threshold, which may influence the output results

rentainhe avatar May 05 '23 06:05 rentainhe

Thanks for your reply. How does the text threshold work? What will be effected by it?

aixiaodewugege avatar May 05 '23 06:05 aixiaodewugege

Hi @aixiaodewugege , Hope you have already found the answer. For others, here is the helpful link for input/output https://github.com/IDEA-Research/GroundingDINO#star-explanationstips-for-grounding-dino-inputs-and-outputs

abhishek0093 avatar Dec 22 '23 14:12 abhishek0093