LLaVA-NeXT video caption often contains " The image "

video caption often contains " The image "

Open valencebond opened this issue 1 year ago • 3 comments

we use the LLaVA-NeXT-Video-DPO (34B)

May 11 '24 07:05 valencebond

I don't know if it's the same with video, but with previous model and images you could just provide a system prompt, for example I may use something like

prompt = "Describe the image in search engine keyword tags"
prompt_format = "[INST] SYSTEM: You are a professional image captioner, describe images as reduced keyword tags for search engines separated by commas.\nUSER: <image>\n<prompt>[/INST]"

May 11 '24 21:05 WAS-PlaiLabs

I couldn't run the model. (README.md)

May 17 '24 07:05 csm12321

#43

Jun 02 '24 14:06 ZhangYuanhan-AI

LLaVA-NeXT LLaVA-NeXT copied to clipboard

video caption often contains " The image "

LLaVA-NeXT
LLaVA-NeXT copied to clipboard