CDN icon indicating copy to clipboard operation
CDN copied to clipboard

questions about `args.use_matching`

Open Dawn-LX opened this issue 2 years ago • 5 comments

Thank you for the nice work of HOI. I'm currently following your work

I have a question about the args.use_matching. It seems that you did not use the matching_embed when training the model. Is that true all the reported model were trained with args.use_matching=False ?

Dawn-LX avatar Dec 20 '22 12:12 Dawn-LX

I also have anothr question about the matching_embed or the interactive_score. In HICO-DET, it seems that the "interactiveness" is not compatible with the verb classifier when the classifer is set as 117-d. because there is a "no_interaction" class as one of the 117 verb classes.

I wonder if someone try a 116-d classifier and then use the "interactiveness" classifier (i.e., a 2-d classifier) as supplement.

In addition, comapred to HOI, another similar task scene graph generation (SGG) does not exist a relation class named "no_interaction", (e.g.,the VisualGenome dataset). And therefore SGG does not use mAP as metric.

I wonder if it is make sense to take account into the "no_interaction" as one of the relation classes and calculate mAP as metric. After all, enumerating and annotating all possible relations between subject-object pair is usually impossible. And annotating all no_interaction sub-obj pair is also impossible

Dawn-LX avatar Dec 20 '22 12:12 Dawn-LX

In addition, according to the dataloader, here: https://github.com/YueLiao/CDN/blob/71c570296ecc7194c7abf99f5207344aa9c8dabe/datasets/hico.py#L120

The matching_labels are all ones, regardless of the verb label being "no_interaction" or not.

Dawn-LX avatar Dec 21 '22 05:12 Dawn-LX

Thank you for the nice work of HOI. I'm currently following your work

I have a question about the args.use_matching. It seems that you did not use the matching_embed when training the model. Is that true all the reported model were trained with args.use_matching=False ?

Yes, all models were trained with args.use_matching=False

YueLiao avatar Dec 21 '22 09:12 YueLiao

I also have anothr question about the matching_embed or the interactive_score. In HICO-DET, it seems that the "interactiveness" is not compatible with the verb classifier when the classifer is set as 117-d. because there is a "no_interaction" class as one of the 117 verb classes.

I wonder if someone try a 116-d classifier and then use the "interactiveness" classifier (i.e., a 2-d classifier) as supplement.

In addition, comapred to HOI, another similar task scene graph generation (SGG) does not exist a relation class named "no_interaction", (e.g.,the VisualGenome dataset). And therefore SGG does not use mAP as metric.

I wonder if it is make sense to take account into the "no_interaction" as one of the relation classes and calculate mAP as metric. After all, enumerating and annotating all possible relations between subject-object pair is usually impossible. And annotating all no_interaction sub-obj pair is also impossible

The HICO-DET only annotated very few no-interactive human-object pairs with a 'no-interaction' label in an image. Accurately, we directly regard the 'no-interaction' label as a common 'interactive' HOI label and didn't treat the 'no-interaction' label as a negative tag. Here, we define the human-object pairs without any annotation (including 'no-interaction') as the no-interactive pairs.

YueLiao avatar Dec 21 '22 09:12 YueLiao

The HICO-DET only annotated very few no-interactive human-object pairs with a 'no-interaction' label in an image. Accurately, we directly regard the 'no-interaction' label as a common 'interactive' HOI label and didn't treat the 'no-interaction' label as a negative tag. Here, we define the human-object pairs without any annotation (including 'no-interaction') as the no-interactive pairs.

Okay, I see. Thank you for your answer!

Dawn-LX avatar Dec 22 '22 07:12 Dawn-LX