InternVL issues

请问vllm目前支持部署InternVL3_5-241B-A28B了吗

2

版本号 vllm==0.10.0 启动命令 CUDA_VISIBLE_DEVICES=4,5,6,7 vllm serve /data/models/InternVL3_5-241B-A28B \ --tensor-parallel-size 4 \ --trust-remote-code 报错信息 INFO 09-10 10:02:12 [__init__.py:235] Automatically detected platform cuda. INFO 09-10 10:02:14 [api_server.py:1755] vLLM API server version 0.10.0...

vincentlbj

Pre-Training & SFT datasets

1

Thank you for your excellent work—InternVL3.5! Will the dataset you used during Pre-Training and SFT phase be made public? In the technical report, you mentioned that some additional data was...

Maple-geekZhu

[Bug] Garbled text when running InternVL-3 / 3.5 with custom multi-GPU split_model on RTX 5880 (works on H20)

### Checklist - [ ] 1. I have searched related issues but cannot get the expected help. - [ ] 2. The bug has not been fixed in the latest...

weirdo2310

pretrain data mixture

请问 ./path/to/pretrain/data/mixture.json 这里的mixture.json是怎么生成的啊？如果需要用自己的pretrain data生成json，可否给个样例？

hhxxttxsh

[Bug] InternVL 3.5 38B OOM

2

### Checklist - [ ] 1. I have searched related issues but cannot get the expected help. - [ ] 2. The bug has not been fixed in the latest...

chiyic

关于论文和代码之间的疑问。

1

我在看"InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks" 这篇论文，论文中3个阶段，都需要一个一个训练才能组成一个模型。论文中的每个阶段训练分别对应代码中的那一块？

scuizhibin

[Bug] Reproducing MMVet, AI2D Results

2

### Checklist - [x] 1. I have searched related issues but cannot get the expected help. - [x] 2. The bug has not been fixed in the latest version. -...

avdravid

post-training InternVL with RL objective using vllm engine and FSDP results in RuntimeError: The tensor has a non-zero number of elements, but its data is not allocated yet.

6

Hello, This may be a shot in the dark, but I am wondering if anyone has tried post training InternVL with an RL objective, say GRPO, using vllm engine for...

SStoica12

Visual Grounding Results

4

I have tried visual grounding for InterVL2.5-8B vs. Qwen2.5-VL-7B not using refCOCO but another referring det dataset and I always found Qwen2.5-VL performance is almost 2x better. I am wondering...

MSiam

issue on activating thinking mode with lmdeploy

2

Hi, thanks for sharing the InternVL3.5 series! The thinking mode can be activated by setting the system prompt when inferecing with transformers, but how should it be done when running...

yijunCai

InternVL
InternVL copied to clipboard

Metadata

请问vllm目前支持部署InternVL3_5-241B-A28B了吗

Pre-Training & SFT datasets

[Bug] Garbled text when running InternVL-3 / 3.5 with custom multi-GPU split_model on RTX 5880 (works on H20)

pretrain data mixture

[Bug] InternVL 3.5 38B OOM

关于论文和代码之间的疑问。

[Bug] Reproducing MMVet, AI2D Results

post-training InternVL with RL objective using vllm engine and FSDP results in RuntimeError: The tensor has a non-zero number of elements, but its data is not allocated yet.

Visual Grounding Results

issue on activating thinking mode with lmdeploy

← Metadata

Owner

Metadata

InternVL InternVL copied to clipboard

Metadata

← Metadata

Owner

Metadata

InternVL
InternVL copied to clipboard