LLaVA-NeXT Meaningless response for video demo

Meaningless response for video demo

Open TRS07170 opened this issue 5 months ago • 0 comments

I tried to run the video_demo.sh script on my own video. I only modified the video path with out changing any other parameters:

bash scripts/video/demo/video_demo.sh lmms-lab/LLaVA-NeXT-Video-7B-DPO vicuna_v1 32 2 average no_token True /path/to/my/video.mp4

Then the output becomes empty, like this:

If I reduce the number sampled frames to 10 or 20, it generate at least readable responses. But if I increase the number of sampled frames to 30 and run the following command:

bash scripts/video/demo/video_demo.sh lmms-lab/LLaVA-NeXT-Video-7B-DPO vicuna_v1 30 2 average no_token True /path/to/my/video.mp4

then the model starts to output meaningless responses:

I wonder how could this happen? Did set any parameters wrong?

Jun 23 '25 03:06 TRS07170

LLaVA-NeXT LLaVA-NeXT copied to clipboard

Meaningless response for video demo

LLaVA-NeXT
LLaVA-NeXT copied to clipboard