LLaVA-NeXT icon indicating copy to clipboard operation
LLaVA-NeXT copied to clipboard

Meaningless response for video demo

Open TRS07170 opened this issue 5 months ago • 0 comments

I tried to run the video_demo.sh script on my own video. I only modified the video path with out changing any other parameters:

bash scripts/video/demo/video_demo.sh lmms-lab/LLaVA-NeXT-Video-7B-DPO vicuna_v1 32 2 average no_token True /path/to/my/video.mp4

Then the output becomes empty, like this: Image

If I reduce the number sampled frames to 10 or 20, it generate at least readable responses. But if I increase the number of sampled frames to 30 and run the following command:

bash scripts/video/demo/video_demo.sh lmms-lab/LLaVA-NeXT-Video-7B-DPO vicuna_v1 30 2 average no_token True /path/to/my/video.mp4

then the model starts to output meaningless responses: Image

I wonder how could this happen? Did set any parameters wrong?

TRS07170 avatar Jun 23 '25 03:06 TRS07170