LLaVA-NeXT issues

中文ocr效果极差

3

1.5阶段不是加入了中文ocr数据么，为什么识别中文依旧没有任何效果

Inference error of LLaVA-NeXT-Video-7B-DPO

2

When I run `bash scripts/video/demo/video_demo.sh ${the path of LLaVA-NeXT-Video-7B-DPO} vicuna_v1 32 2 True ${the path of video}` I get the error ``` Can't set vocab_size with value 32000 for LlavaConfig...

qjq-111

Will there be supported for llava-hf soon?

1

Hi LLaVA-NeXT team, Will there be official support for llava-hf versions for the new LLaVA-NeXT (2024-05 Release) models soon?

justinphan3110cais

looking forward releasing training code

Nastu-Ho

There is tremendous confusion about which chat templates to use with which models

The best thing to do would be to author correct chat templates in both your repository's huggingface models and the official huggingface ones. It should also interact with the AutoProcessor...

doctorpangloss

Only output [1, 2] tokens for 'lmms-lab/LLaVA-NeXT-Video-7B-DPO' video demo inference

2

the output of output_ids is tensor([[1, 2]], device='cuda:0') Other output of the demo script is: Question: A chat between a curious user and an artificial intelligence assistant. The assistant gives...

LeonLIU08