VisionLLM icon indicating copy to clipboard operation
VisionLLM copied to clipboard

How to get all object super link of $$H_{link}$$ in last-layer hidden states during testing ?

Open zc-zhao opened this issue 1 year ago • 1 comments

I have read issue 11, but still have some questions. During testing, you utilize the forward function for testing

https://github.com/OpenGVLab/VisionLLM/blob/028f8b3d8a8f7d41edef5c0046e8bc49e25e7144/VisionLLMv2/visionllmv2/eval/eval_det.py#L119

But as far as I know, the forward function cannot perform the next token generation, resulting in struggling to output answer and [DET] token. Why don't you use the generate function? Is it possible that the inputs ids in your test contain the answer?

When will you make the training and testing datasets and dataloader public? https://github.com/OpenGVLab/VisionLLM/blob/028f8b3d8a8f7d41edef5c0046e8bc49e25e7144/VisionLLMv2/visionllmv2/eval/eval_det.py#L111

zc-zhao avatar Nov 19 '24 04:11 zc-zhao

I have read issue 11, but still have some questions. During testing, you utilize the forward function for testing

https://github.com/OpenGVLab/VisionLLM/blob/028f8b3d8a8f7d41edef5c0046e8bc49e25e7144/VisionLLMv2/visionllmv2/eval/eval_det.py#L119

But as far as I know, the forward function cannot perform the next token generation, resulting in struggling to output answer and [DET] token. Why don't you use the generate function? Is it possible that the inputs ids in your test contain the answer?

When will you make the training and testing datasets and dataloader public?

https://github.com/OpenGVLab/VisionLLM/blob/028f8b3d8a8f7d41edef5c0046e8bc49e25e7144/VisionLLMv2/visionllmv2/eval/eval_det.py#L111

I have the same question, do you have any new insights?

josephzpng avatar Jan 08 '25 07:01 josephzpng