Yuxuan Wang comments

Results 39 comments of


                                            Yuxuan Wang

Discrepancy in Image ID Alignment Between M3IT and VideoChat2IT

Hi, I didn't find `image/caption/minigpt4` from M3IT, how can I obtain these images?

Detect new object and Keep tracking of old obejcts

For SAM2, it depends on the length of the FIFO queue. For Grounded-SAM2, you can preserve the text of the old object as the input for GroundingDINO. Unfortunately, this code...

Detect new object and Keep tracking of old obejcts

Apologies for the late reply. I believe it is possible as long as the object remains within the FIFO queue.

Prompting like Grounded SAM2

I haven't implemented multiple references in your case due to my limited bandwidth. However, I believe it could be achieved by modifying the `add_new_points_or_box` function. Feel free to submit a...

Expected Latency?

This is an important problem, and I'd like to give you an exact answer. However, I don't have much time at the moment. Could you try to tackle it and...

Plans to integrate newest SAM2/Grounded-SAM-2 update

Thank you for your interest. I will provide updates if time permits and will inform you as soon as there is any progress.

Sorry for the late reply. Please forgive me, as I didn't receive any notification. The evaluation data are from VideoChatGPT and Video-LLaVA, which can be obtained directly from this [link](https://github.com/PKU-YuanGroup/Video-LLaVA/blob/main/TRAIN_AND_VALIDATE.md#data-for-validating).

Yuxuan Wang

Discrepancy in Image ID Alignment Between M3IT and VideoChat2IT

Detect new object and Keep tracking of old obejcts

Detect new object and Keep tracking of old obejcts

Prompting like Grounded SAM2

Expected Latency?

Plans to integrate newest SAM2/Grounded-SAM-2 update

No gt_file_question

About the vocabulary inconsistence

Library Environment Request

Confusion about track function in `SAM2CameraPredictor`