InternVL icon indicating copy to clipboard operation
InternVL copied to clipboard

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Results 461 InternVL issues
Sort by recently updated
recently updated
newest added

版本号 vllm==0.10.0 启动命令 CUDA_VISIBLE_DEVICES=4,5,6,7 vllm serve /data/models/InternVL3_5-241B-A28B \ --tensor-parallel-size 4 \ --trust-remote-code 报错信息 INFO 09-10 10:02:12 [__init__.py:235] Automatically detected platform cuda. INFO 09-10 10:02:14 [api_server.py:1755] vLLM API server version 0.10.0...

Thank you for your excellent work—InternVL3.5! Will the dataset you used during Pre-Training and SFT phase be made public? In the technical report, you mentioned that some additional data was...

### Checklist - [ ] 1. I have searched related issues but cannot get the expected help. - [ ] 2. The bug has not been fixed in the latest...

请问 ./path/to/pretrain/data/mixture.json 这里的mixture.json是怎么生成的啊?如果需要用自己的pretrain data生成json,可否给个样例?

### Checklist - [ ] 1. I have searched related issues but cannot get the expected help. - [ ] 2. The bug has not been fixed in the latest...

我在看"InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks" 这篇论文,论文中3个阶段,都需要一个一个训练才能组成一个模型。论文中的每个阶段训练分别对应代码中的那一块?

### Checklist - [x] 1. I have searched related issues but cannot get the expected help. - [x] 2. The bug has not been fixed in the latest version. -...

Hello, This may be a shot in the dark, but I am wondering if anyone has tried post training InternVL with an RL objective, say GRPO, using vllm engine for...

I have tried visual grounding for InterVL2.5-8B vs. Qwen2.5-VL-7B not using refCOCO but another referring det dataset and I always found Qwen2.5-VL performance is almost 2x better. I am wondering...

Hi, thanks for sharing the InternVL3.5 series! The thinking mode can be activated by setting the system prompt when inferecing with transformers, but how should it be done when running...