LLaVA-NeXT issues

llama3 chat format error in the example

1

https://github.com/LLaVA-VL/LLaVA-NeXT/blob/inference/docs/LLaVA-NeXT.md In this example, your code generate double "" in front of "user" for the prompt_question variable. Could you check if there is any mistake in your code. Below is...

y-rok

Code of higher-AnyRes

As describeed in blog-2024-05-25(https://llava-vl.github.io/blog/2024-05-25-llava-next-ablations/), higher-AnyRes is proposed to avoid the loss of detail for high-resolution images. Where can I find the **code of** **higher-AnyRes image dividing method** and the **thresholded...

Super-Shen

ValueError: Unable to create tensor, you should probably activate padding with 'padding=True' to have batched tensors with the same length.

1

code：video = image_processor.preprocess(video, return_tensors="pt")["pixel_values"].half().cuda()

XHB-ZMM

training code

6

Hello, I am trying to find the training code, but it seems like there is just inference code. Can you please point to the training code?

ehartford

Update LLaVA-NeXT.md - Typo (missing letter)

TayyibChohan

Actual Demo for Interleave

2

Gradio is fine for playing around but can you please add proper demo code like you've done for the other llava-next models?

Jchang4

Make some ad-hoc changes to use the interleave model

Hi, team. Thank you for your great work. I made some ad-hoc changes to use the interleave model. Please let me know if I need to change something.

sj-h4

About llava-next-interleave inference

2

`bash playground/demo/interleave_demo.py --model_path path/to/ckpt` The execute code should be run with python not bash. And How can this code specify the input image sequence? It appears to be just a...

adkAurora

Question about AnyRes calculations: bug or intended?

1

Hello LLaVa-NeXT team! I want to clarify some points about the AnyRes technique and how the image feature is unpadded in modeling forward. As this [issue](https://github.com/huggingface/transformers/issues/31327) shows, seems like a...

zucchini-nlp

Training/Finetunning code please

5

Hi, Dear author: It seems the llava-next is really insightful exploreing work. Please kindly release the training and inference code asap, thank you very much.

dragen1860

LLaVA-NeXT
LLaVA-NeXT copied to clipboard

Metadata

llama3 chat format error in the example

Code of higher-AnyRes

ValueError: Unable to create tensor, you should probably activate padding with 'padding=True' to have batched tensors with the same length.

training code

Update LLaVA-NeXT.md - Typo (missing letter)

Actual Demo for Interleave

Make some ad-hoc changes to use the interleave model

About llava-next-interleave inference

Question about AnyRes calculations: bug or intended?

Training/Finetunning code please

← Metadata

Owner

Metadata

LLaVA-NeXT LLaVA-NeXT copied to clipboard

Metadata

← Metadata

Owner

Metadata

LLaVA-NeXT
LLaVA-NeXT copied to clipboard