InternVL icon indicating copy to clipboard operation
InternVL copied to clipboard

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Results 461 InternVL issues
Sort by recently updated
recently updated
newest added

``` 3 prompts = ['帮我提取图中的信息,以json的格式直接返回内容'] * len(image_counts) 4 responses = model.batch_chat(tokenizer, pixel_values, 5 image_counts=image_counts, 6 questions=prompts, 7 generation_config=generation_config) ``` File /usr/local/lib/python3.8/site-packages/torch/nn/modules/module.py:1688, in Module.__getattr__(self, name) 1686 if name in modules: 1687...

GPUS=8 sh evaluate.sh refcoco 测试脚本 InternVL 1.5 测试出的指标如下,请问一下是代码有问题吗 ['model/InternVL-Chat-V1-5', 'refcoco_val', 'Precision @ 1: 0.9019752630607347 \n'] ['model/InternVL-Chat-V1-5', 'refcoco_testA', 'Precision @ 1: 0.9284072830121973 \n'] ['model/InternVL-Chat-V1-5', 'refcoco_testB', 'Precision @ 1: 0.8563297350343474 \n'] ['model/InternVL-Chat-V1-5',...

已按要求完成环境部署,按照文档 How to deploy a local demo? 的说明运行时出现以下异常。 (internvl) yushen@user-MS-7E06:~/ai/InternVL/internvl_chat_llava$ python -m llava.serve.gradio_web_server --controller http://0.0.0.0:10000 --model-list-mode reload --device auto 2024-05-24 15:14:59 | ERROR | stderr | Traceback (most recent call...

Hi all, Thank you for your wonderful work! I am trying to run the model of InternVL−Chat−V1.5-Int8 using the [huggingface link](https://huggingface.co/OpenGVLab/InternVL-Chat-V1-5-Int8). I was able to get one inference result but...

在官方给出的demo中,使用了example文件夹下的image1以及image2,可以顺利进行multi-image conversation 但如果简单将image1、2换成image4、5,模型就只能识别到其中的一张图片,查看分辨率发现image4的分辨率达到了1920x1200,而image2的1000x1000分辨率就不会出现问题。 想请问下进行multi-image conversation的话,单张图像的分辨率上限是多少呢?同时模型最多可以支持多少张图像的输入呢? @czczup 谢谢!^_^

internvl_chat/shell/internlm2_20b_dynamic/internvl_chat_v1_5_internlm2_20b_dynamic_res_finetune.sh how many gpus are need for finetuning? I noticed that for 1-2 version: Note: fine-tune the full LLM needs 16 A100 80G GPUs, and fine-tune the LoRA needs 2...

### Motivation 我这边报内存不够,应该是显存不够的意思吧。目前测试的是推理。 ### Related resources ![显存不够](https://github.com/user-attachments/assets/668a981f-a2af-4d3f-9c94-1e5138fa15b4) ### Additional context _No response_

### Motivation 希望能给个数据。大概说一下就行。 ### Related resources _No response_ ### Additional context _No response_

### Checklist - [X] 1. I have searched related issues but cannot get the expected help. - [X] 2. The bug has not been fixed in the latest version. -...

### Motivation 我看本地demo是可以视频推理的,但是api只能接收图片,这个后边可以支持吗? ### Related resources _No response_ ### Additional context _No response_