LLaVA-NeXT icon indicating copy to clipboard operation
LLaVA-NeXT copied to clipboard

what prompt format shoud be if I want to input an image and a video at the same time

Open bendanzzc opened this issue 1 year ago • 1 comments

Thanks for opening source the powfer model. the case show how to generate prompt for text-image and video-image. but If I want to input image-video-text at the same time, how to generate prompt properly. thanks

bendanzzc avatar Sep 11 '24 08:09 bendanzzc

i want to know,too

guoyanan1g avatar Sep 29 '24 08:09 guoyanan1g