InternVL
InternVL copied to clipboard
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
### Motivation 根据结构来看好像是用dpo脚本训练internVL的MPO 请问如果是QwenVL的模型是不是不支持 有没有什么方法迁移过去 ### Related resources _No response_ ### Additional context _No response_
In table 13 of your paper ”Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling” shows that the **result of InternVL2_5-2B on GSM8K(4-shot)** is about **55**,...
### Motivation Many of us only have a single node with several GPUs, and it is more common to use torchrun than srun. Hopefully, there will be an official script...
### Checklist - [x] 1. I have searched related issues but cannot get the expected help. - [x] 2. The bug has not been fixed in the latest version. -...
Hi, is it possible to use Internvl2.5 to do segmentation tasks ?
在目录下没有看到MPO lora的微调,请问是目前不支持吗,还是说MPO的lora微调用的是2_5的lora微调脚本?
### Motivation Would it be possible to add a GRPO fine tuning stage to InternVL (2.5) ? I believe it would be great to teach InternVL how to reason without...
### 📚 The doc issue In the documentation "Enhancing InternVL2 on COCO Caption Using LoRA Fine-Tuning", it is written at the end to copy the config.json file from the original...
### 📚 The doc issue 在mlp1层之前,新增一个全连接层,应该怎么训练,视觉层和语言层都参与训练吗?辛苦帮忙解答下@czczup ### Suggest a potential alternative/fix _No response_
### Checklist - [x] 1. I have searched related issues but cannot get the expected help. - [ ] 2. The bug has not been fixed in the latest version....