STTran icon indicating copy to clipboard operation
STTran copied to clipboard

customized inputs

Open Mingyuan1997 opened this issue 2 years ago • 6 comments

Is there any suggestion on how to run the model on customized input videos?

Thank you!

Mingyuan1997 avatar Apr 19 '22 21:04 Mingyuan1997

Hi, i think it is not difficult to use the model to infer the customized videos. Line 167-178 in https://github.com/yrcong/STTran/blob/main/dataloader/action_genome.py may help you:) best

yrcong avatar Apr 20 '22 00:04 yrcong

If we test on custom video, it seems that the information like attention_relationship and bboxes are required?

xiaodanhu avatar Apr 21 '22 14:04 xiaodanhu

I think so.

Mingyuan1997 avatar Apr 22 '22 19:04 Mingyuan1997

If we test on custom video, it seems that the information like attention_relationship and bboxes are required?

Why? The attention relationships should be predicted and the bboxes should be inferred by the object detector.

yrcong avatar Apr 25 '22 20:04 yrcong

If we test on custom video, it seems that the information like attention_relationship and bboxes are required?

Why? The attention relationships should be predicted and the bboxes should be inferred by the object detector.

Thanks for replying! When I checked the dataloader in action_genome.py, the person_bbox and object_bbox were loaded from person_bbox.pkl and object_bbox_and_relationship.pkl, respectively. So I was guessing, if loading the customized video, we also need this information so that we can load it properly. Can you indicate where the model automatically predicts the relationship and bboxes? Thank you very much!

xiaodanhu avatar Apr 25 '22 20:04 xiaodanhu

If you just want to test on your customized video dataset, only the video frames (self.video_list in the class AG) are necessary (for the setting SGDET). Sometimes person_bbox and object_bbox are used in the test because there are other two settings PredCLS/SGCLS which are widely used in the image scene graph generation.

yrcong avatar Apr 29 '22 09:04 yrcong