LLaVA-NeXT issues

Where to find multi-image training data?

1

Hi, I can only find a subset of the image from the dataset in [huggingface](https://huggingface.co/datasets/lmms-lab/LLaVA-OneVision-Data) And I don't find multi-image datas in the dataset. for example, tqa is a multi-image...

jyC23333

LLaVA-OneVision数据问题

3

想问下论文中提到的LLaVA-158K数据有开源么 https://huggingface.co/datasets/lmms-lab/LLaVA-OneVision-Data 这里好像没找到这个数据

will-wiki

(Community Chatting Group)建一个微信交流群，这样大家有问题可以实时讨论

1

不是引流，只是考虑到可能大家会有些不构成 issue 的小问题，有个群会比较好。后续如果官方有需要，我愿意转让群管理我的微信 dreamingforhope ，若二维码失效可添加我 ![image](https://github.com/user-attachments/assets/ac21772d-2a9b-42bf-bc43-3a842f36cf6a)

chmod777john

Inferencing Finetuned model

3

Hello, I finetuned LLaVA onevision with Qwen2-7B. In the finetuning script, I set it to finetune just the adapter. When I am trying to inference my model, I am using...

rfoxes

[BUG] Function `prepare_inputs_labels_for_multimodal` flattens batch data

2

In the file `llava/model/llava_arch.py` under the class `LlavaMetaForCausalLM` there is a function`prepare_inputs_labels_for_multimodal` that is called when calling the `generate` and `forward` functions. In lines 411 and 412, the input embeds...

guyazran

(Community Chatting Group)建一个微信交流群，这样大家有问题可以实时讨论

1

不是引流，只是考虑到可能大家会有些不构成 issue 的小问题，有个群会比较好。后续如果官方有需要，我愿意转让群管理我的微信 dreamingforhope ，若二维码失效可添加我 ![image](https://github.com/user-attachments/assets/b3af120b-9eda-44eb-a9fc-ab8f0d4b3990)

chmod777john

Too much "I’m sorry" gpt answers in M4-Instruct-Data

1

Have the authors cleaned the datasets? ``` [{'from': 'gpt', 'value': 'Help me write a Twitter post considering the following images.\n'}, {'from': 'human', 'value': "I'm sorry, I can't assist with that...

cxmscb

The OneVision-7B Demo is unable to return a result.[bug]

1

There are some issues with the online OneVision-7B Demo. When two images are input for inference, it fails to return a result. @Luodian @ZhangYuanhan-AI @Luodian ![1](https://github.com/user-attachments/assets/984194ba-a09b-47cc-ad1c-68cf8429d491)

AmazDeng

AttributeError: 'PreTrainedTokenizerFast' object has no attribute 'legacy'

1

I am using a pretrain adapter with deepspeed --pretrain_mm_mlp_adapter /home/srikanth/api-webapp/checkpoints/llava-v1.5-llama-3-8b-pretrain/mm_projector.bin but this throws an error "AttributeError: 'PreTrainedTokenizerFast' object has no attribute 'legacy'" The pretrained adapter was not created with the...

SrikanthChellappa