LLaVA-NeXT icon indicating copy to clipboard operation
LLaVA-NeXT copied to clipboard

Results 315 LLaVA-NeXT issues
Sort by recently updated
recently updated
newest added

I looked at the model card introduction but didn't see what the main differences are between these two models. Could the author explain?

if `image_idx not in video_idx_in_batch`, `image_feature` will be added into `new_image_features` repeatedly, which should be avoided. That's what this PR does.

For the llava-onevision model, the official video inference code does not modify the `image_aspect_ratio` parameter, resulting in the use of the default `anyres_max_9`. This causes the `image_features` to occupy a...

For the first version of the llava-next-video project, the model chosen was LLaVA-NeXT-Video-7B-DPO. If the number of frames is set to 32, the final inputs_embeds dimension sent to the llama2...

I am attempting to run the `finetune_onevision.sh` script. I've gotten many things sorted out but I am stumped by the `--pretrain_mm_mlp_adapter` argument. The default value as provided in the script...

Hi, thank you very much for you research, it is very interesting! I am interested in running the LLaVA-Next Interleave model but the file playground/demo/interleave_demo.py is missing Can i find...

Hi! I wonder know whether you have the plan to release the checkpoint of LLaVA-NeXT-Video stage 1, the pretrain version. I want to finetune the model from this stage!

Hi~,I am recently trying to use the llava_onevision model, I try to follow the onevision tutorial, which seems pretty easy. I run the program exactly as the tutorial, the model...