LLaVA-Align icon indicating copy to clipboard operation
LLaVA-Align copied to clipboard

The same model_kwargs and model_kwargs_cd.

Open Stevetich opened this issue 1 year ago • 1 comments

Hi. I have noticed that you adopted the code of VCD as your base code. But I found that they use the same model_kwargs and model_kwargs_cd to generate tokens. I am confused because past_key_values term is also incorporated in model_kwargs, which means the same past_key_values term is used in original and distorted images as the visual inputs. Is that operation correct?

Stevetich avatar May 23 '24 05:05 Stevetich

Thanks for pointing that out! I'm not entirely clear on the issue you're describing, though. Can you explain the problem a bit more? I'd really appreciate it if you could help me understand why this operation might be incorrect.

yfzhang114 avatar May 23 '24 09:05 yfzhang114