Grounded-Segment-Anything icon indicating copy to clipboard operation
Grounded-Segment-Anything copied to clipboard

multiple object inference

Open mhyeonsoo opened this issue 1 year ago • 2 comments

Hi, thanks for the great implementations. I am trying to run this code with the multi-object prompt. As you guided, I used ' . ' between each classes, but could get a bit weird results. I have two questions with my one sample output image.

image

  1. My prompt is "asparagus . carrot . zucchini ." Is there some mistakes with this prompt? I am wondering why the model predicts all the asparagus as zucchini, and I don't think this is only because of the DINO performance issue.
  2. Is there a case that we can see broken class output such as '##ni' on the image?

Thanks again!

mhyeonsoo avatar Apr 18 '23 02:04 mhyeonsoo

Same question here. I tried "cat . dog ." on asset/demo6.jpg, but I got wierd output phrase "cat dog". I also tried "cat,dog", it does return cat or dog. But the number of detected objects decreases and some objects are misclassified.

tdzdog avatar Apr 18 '23 14:04 tdzdog

Has anyone solved the issue? Not yet solved from my side yet.

mhyeonsoo avatar May 02 '23 02:05 mhyeonsoo

Has anyone solved the issue? Not yet solved from my side yet.

I also face this problem. Another question for me: I can get correct segmentation with single prompt (e.g. cat ), however I cannot segment out this object with multi-object prompt (e.g. like cat . dog .) Have you noticed this problem?

longyangqi avatar May 22 '23 06:05 longyangqi

@longyangqi yeah, that issue is just what I am facing recently. Hope the authors are taking care of it.

mhyeonsoo avatar May 24 '23 00:05 mhyeonsoo

@mhyeonsoo The "##ni" is caused by the tokenizer (something like splitting zucchini into "zucchi" + "##ni") you can have a look at https://github.com/IDEA-Research/Grounded-Segment-Anything/blob/main/GroundingDINO/groundingdino/util/inference.py

longyangqi avatar May 26 '23 06:05 longyangqi