LLaVA-NeXT
LLaVA-NeXT copied to clipboard
what prompt format shoud be if I want to input an image and a video at the same time
Thanks for opening source the powfer model. the case show how to generate prompt for text-image and video-image. but If I want to input image-video-text at the same time, how to generate prompt properly. thanks
i want to know,too