ViP-LLaVA icon indicating copy to clipboard operation
ViP-LLaVA copied to clipboard

[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts

Results 20 ViP-LLaVA issues
Sort by recently updated
recently updated
newest added

### Question First of all, great work and thank you so much for open-source it! I wonder if the stage 2 model(referred as ViP-LLaVA-Base) has been released anywhere? Maybe mucai/vip-llava-13b-pretrain?...

### RuntimeError: The expanded size of the tensor (628) must match the existing size (129) at non-singleton dimension 3. Target sizes: [1, 32, 1, 628]. Tensor sizes: [1, 1, 1,...

### Describe the issue Hi there, thank you for sharing this awesome project. I have one question about the requried packages for multi-image and multi-prompt generation. In the following link,...

### Discussion Thanks for your work, but the demo is hard to use, especially the module of 'Upload images'. Is there any solution to upload my own images successfully?

### Question Hi, thanks for your contribution. Could you please provide the test set link or converted file used in docs/Evaluation.md?

So that it can not be easily missleading model using llava original dataset. Meanwhile, it looks like the images are missing.. "id": "vcr-52941", "image": "vcr1images/lsmdc_3034_IDES_OF_MARCH/[email protected]", "meta_dir": "./dataset/vcr1images/lsmdc_3034_IDES_OF_MARCH/[email protected]", "class_names": [ "person",

### Question Hello @mu-cai , Thanks for your excellent work. How can we get the VIP-Bench?

### Question can't use the demo

### Discussion Hi, I have one concern about the model taking in the conversation. One conversation might have multiple questions and answers and it seems (from my understanding) the model...

### Describe the issue Issue: pip install different version with your version which mention in [issue#24](https://github.com/WisconsinAIVision/ViP-LLaVA/issues/24#issuecomment-2256470278) Command: ``` git clone https://github.com/WisconsinAIVision/ViP-LLaVA.git conda create -n vip-llava python=3.10 -y conda activate vip-llava...