LLaVA-NeXT icon indicating copy to clipboard operation
LLaVA-NeXT copied to clipboard

Results 344 LLaVA-NeXT issues
Sort by recently updated
recently updated
newest added

Hello, We'd like to begin by expressing our sincere appreciation for your team's excellent work on LLaVA-OneVision and for making this powerful model publicly available. It is a fantastic contribution...

for vision_model.post_layernorm.weight: copying from a non-meta parameter in the checkpoint to a meta parameter in the current model, which is a no-op. (Did you mean to pass `assign=True` to assign...

I'm trying to train a model but in `llava/train/llava_trainer.py` file. It has broken imports everywhere. I follow the installation in the Readme.md > conda create -n llava python=3.10 -y >...

### Summary Resolve import errors encountered when running `scripts/train/pretrain_clip.sh` by adding the required imports and applying minor formatting updates in `llava/train/llava_trainer.py`. No functional changes intended beyond unblocking the training run....

The code provides two arguments: 1) mm_vision_tower_lr, and 2) mm_projector_lr to set the learning rate externally. However, this does not take effect. I speculate the reason is that the optimizer...

I am trying to extract attention weights from the model and thus need to use `eager` implementation. The following code works; ```python # pip install git+https://github.com/LLaVA-VL/LLaVA-NeXT.git from llava.model.builder import load_pretrained_model...

I encounter a very weird bug when I run LLaVa_OneVision_Tutorial.ipynb ../aten/src/ATen/native/cuda/IndexKernel.cu:92: operator(): block: [0,0,0], thread: [63,0,0] Assertion `-sizes[i]

Hi @taesiri 🤗 I'm Niels and work as part of the open-source team at Hugging Face. I discovered your work through Hugging Face's daily papers as yours got featured: https://huggingface.co/papers/2509.00676....