InternVL icon indicating copy to clipboard operation
InternVL copied to clipboard

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Results 461 InternVL issues
Sort by recently updated
recently updated
newest added

请问modeling文件中的 ` image_flags = image_flags.squeeze(-1) input_embeds = self.language_model.get_input_embeddings()(input_ids).clone() vit_embeds = self.extract_feature(pixel_values) vit_embeds = vit_embeds[image_flags == 1] vit_batch_size = pixel_values.shape[0] ` vit_embeds` = vit_embeds[image_flags == 1]的作用是什么呢

### 📚 The doc issue ``` import numpy as np import torch import torchvision.transforms as T from decord import VideoReader, cpu from PIL import Image from torchvision.transforms.functional import InterpolationMode from...

如何接入摄像头的视频流进行抽帧来实时处理问题

### Checklist - [ ] 1. I have searched related issues but cannot get the expected help. - [ ] 2. The bug has not been fixed in the latest...

Hi I am trying to reproduce your results from the InternVL2.5 family not he MMMU dataset. When running InternVL2.5 family with VLMEvalKit framework for sizes larger then 2Bn on the...

I have tried to make InternVL2.5-8B output the boundingbox of the elements in the image, but it doesn't work. My prompt is ``` \nPlease detect and provide all the bounding...

First of all, I would like to express my gratitude to the team for sharing such an impressive model and making it accessible to the community. I am trying to...

### Motivation 请问InternVL2_5-38B可以用vllm部署吗? ### Related resources _No response_ ### Additional context _No response_

### Motivation deepspeed支持pipeline并行,如果想用internvl框架支持pipeline并行,应该如何进行修改呢? ### Related resources _No response_ ### Additional context _No response_

### Checklist - [X] 1. I have searched related issues but cannot get the expected help. - [X] 2. The bug has not been fixed in the latest version. -...