LLaVA-NeXT
LLaVA-NeXT copied to clipboard
Why plain prompt version in pretraining stage?
Hi,
is the prompt version in the pre-training stage of onevision (see https://github.com/LLaVA-VL/LLaVA-NeXT/blob/main/scripts/train/pretrain_clip.sh) set to plain on purpose? Should it not be qwen_2? If it is done on purpose could you explain why you decide not to use a prompt template in the pre-training stage?