LLaVA-NeXT issues

Question about M4-Instruct datasets

3

Thank your for your kindly release! But when i looking at the annotations of M4-Instruct, the FIRST sample just quite confused me. Here is the snapshot: ![image](https://github.com/LLaVA-VL/LLaVA-NeXT/assets/12946952/18e734c7-ea80-46d3-a3e0-855c5aa354ad) The human and...

syspider

SGLang example not working: KeyError: 'answer'

1

Following the SGLang instructions on the README.md ``` [email protected]:~/sglang$ bash examples/usage/llava_video/srt_example_llava_v.sh K 0 /root/sglang/examples/usage/llava_video/videos/Q98Z4OTh8RwmDonc.mp4 /root/models/LLaVA-NeXT-Video-7B-DPO 16 examples/usage/llava_video Each video you will sample 16 frames Number of GPUs in GPULIST: 8...

k-nearest-neighbor

Interleave demo limited to 13 images?

I'm trying out the qwen model on the democode for interleave. It seems that beyond 13 images (inside one prompt) the model just outputs an empty response (always). Also when...

uahic

Evaluation Prompts for MMMU and Mathvista

Hello, great work! I have a question I'd like to ask - what prompts were used when evaluating MMMU and Mathvista? I'd appreciate if you could provide information on the...

mactavish91

Where is the module llavavid?

15

Leon1207

problem of LLaVA-OneVision-Data

1

There is no multi-image for llava-onevision-data, https://huggingface.co/datasets/lmms-lab/LLaVA-OneVision-Data; Is the data complete?

liuheng0111

Evaluation of the video detailed description

1

Hi @ZhangYuanhan-AI Thanks for the wonderful job. Just a question about the evaluation of the detailed description. I found the result of the gpt eval score will be convert to...

royzhang12

How to select training data

Thanks for the excellent job. LLaVAs utilize a subset of training data from each source, could you please share some insights/techniques about sampling/filtering the original data apart from finding high-quality...

zjy-ucas

Number of image and image placeholder inconsistent of the M4-Instruct-Data

3

Hi, I find the number of image and image placeholder inconsistent in some instances of the M4-Instruct-Data. For example, there are two image placeholders and 4 image paths, which is...

laserwave

[Help for beginners] Code for running interleave

Great project, appreciate it highly :) To give something back (not much, but may help some beginners to get started) here is my code for using interleave without gradio implemented...

uahic

LLaVA-NeXT
LLaVA-NeXT copied to clipboard

Metadata

Question about M4-Instruct datasets

SGLang example not working: KeyError: 'answer'

Interleave demo limited to 13 images?

Evaluation Prompts for MMMU and Mathvista

Where is the module llavavid?

problem of LLaVA-OneVision-Data

Evaluation of the video detailed description

How to select training data

Number of image and image placeholder inconsistent of the M4-Instruct-Data

[Help for beginners] Code for running interleave

← Metadata

Owner

Metadata

LLaVA-NeXT LLaVA-NeXT copied to clipboard

Metadata

← Metadata

Owner

Metadata

LLaVA-NeXT
LLaVA-NeXT copied to clipboard