Grounded-Segment-Anything
Grounded-Segment-Anything copied to clipboard
detect objects with affordance descriptions
Hi! Thanks for your admiring work! I would like to know if there are any methods to defect objects with affordance descriptions. For example, we want to detect 'the cat on the table'. Thus, we want to detect the cat rather than the table. However, current Grouded DINO would extract all nouns in the prompts first and seemingly will extract 'cat' and 'table', respectively.
thanks for your question. groundingdino can finish this as it is a grounding model.
The model will output multiple objects. Each object has a text similarity over text tokens. You can use a text mask to filter out the boxes with highest similarities with token 'cat'.
You may need to add extra codes to finish this for now.
Thanks for your questiones. We will make the solution more user-friendly soon.
I understand. Thanks for your reply!
@SlongLiu Hi, thanks for this awesome work. I find the segmentation results of multiple object is not as accurate than the single object segmentation. May I know for any suggestions for this? Thanks.