InternVL
InternVL copied to clipboard
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
请问modeling文件中的 ` image_flags = image_flags.squeeze(-1) input_embeds = self.language_model.get_input_embeddings()(input_ids).clone() vit_embeds = self.extract_feature(pixel_values) vit_embeds = vit_embeds[image_flags == 1] vit_batch_size = pixel_values.shape[0] ` vit_embeds` = vit_embeds[image_flags == 1]的作用是什么呢
### 📚 The doc issue ``` import numpy as np import torch import torchvision.transforms as T from decord import VideoReader, cpu from PIL import Image from torchvision.transforms.functional import InterpolationMode from...
如何接入摄像头的视频流进行抽帧来实时处理问题
### Checklist - [ ] 1. I have searched related issues but cannot get the expected help. - [ ] 2. The bug has not been fixed in the latest...
Hi I am trying to reproduce your results from the InternVL2.5 family not he MMMU dataset. When running InternVL2.5 family with VLMEvalKit framework for sizes larger then 2Bn on the...
I have tried to make InternVL2.5-8B output the boundingbox of the elements in the image, but it doesn't work. My prompt is ``` \nPlease detect and provide all the bounding...
First of all, I would like to express my gratitude to the team for sharing such an impressive model and making it accessible to the community. I am trying to...
### Motivation 请问InternVL2_5-38B可以用vllm部署吗? ### Related resources _No response_ ### Additional context _No response_
### Motivation deepspeed支持pipeline并行,如果想用internvl框架支持pipeline并行,应该如何进行修改呢? ### Related resources _No response_ ### Additional context _No response_
### Checklist - [X] 1. I have searched related issues but cannot get the expected help. - [X] 2. The bug has not been fixed in the latest version. -...