LLaMA-VID A question about the image token.

A question about the image token.

Open Syloveslife opened this issue 4 months ago • 0 comments

Hi, authors, I would like to ask if the image token is inserted into every question during multi-turn dialogue training. What is the purpose of doing this? Is it to improve performance?

Sep 30 '24 06:09 Syloveslife

LLaMA-VID LLaMA-VID copied to clipboard

A question about the image token.

LLaMA-VID
LLaMA-VID copied to clipboard