InternVL issues

AttributeError: 'InternVLChatModel' object has no attribute 'batch_chat'

``` 3 prompts = ['帮我提取图中的信息，以json的格式直接返回内容'] * len(image_counts) 4 responses = model.batch_chat(tokenizer, pixel_values, 5 image_counts=image_counts, 6 questions=prompts, 7 generation_config=generation_config) ``` File /usr/local/lib/python3.8/site-packages/torch/nn/modules/module.py:1688, in Module.__getattr__(self, name) 1686 if name in modules: 1687...

longgb246

Refer Expression Comprehension RefCOCO 验证结果不一致

2

GPUS=8 sh evaluate.sh refcoco 测试脚本 InternVL 1.5 测试出的指标如下，请问一下是代码有问题吗 ['model/InternVL-Chat-V1-5', 'refcoco_val', 'Precision @ 1: 0.9019752630607347 \n'] ['model/InternVL-Chat-V1-5', 'refcoco_testA', 'Precision @ 1: 0.9284072830121973 \n'] ['model/InternVL-Chat-V1-5', 'refcoco_testB', 'Precision @ 1: 0.8563297350343474 \n'] ['model/InternVL-Chat-V1-5',...

yanzaaaasa

运行本地demo时发生错误，恳请帮助，谢谢。

2

已按要求完成环境部署，按照文档 How to deploy a local demo? 的说明运行时出现以下异常。 (internvl) yushen@user-MS-7E06:~/ai/InternVL/internvl_chat_llava$ python -m llava.serve.gradio_web_server --controller http://0.0.0.0:10000 --model-list-mode reload --device auto 2024-05-24 15:14:59 | ERROR | stderr | Traceback (most recent call...

ysyx2008

an illegal memory access was encountered when running InternVL−Chat−V1.5-Int8 model

1

Hi all, Thank you for your wonderful work! I am trying to run the model of InternVL−Chat−V1.5-Int8 using the [huggingface link](https://huggingface.co/OpenGVLab/InternVL-Chat-V1-5-Int8). I was able to get one inference result but...

tairen99

多图推理的最大分辨率问题

在官方给出的demo中，使用了example文件夹下的image1以及image2，可以顺利进行multi-image conversation 但如果简单将image1、2换成image4、5，模型就只能识别到其中的一张图片，查看分辨率发现image4的分辨率达到了1920x1200,而image2的1000x1000分辨率就不会出现问题。想请问下进行multi-image conversation的话，单张图像的分辨率上限是多少呢？同时模型最多可以支持多少张图像的输入呢？ @czczup 谢谢！^_^

zhangye0402

how many gpus and how long time does it cost for V1-5 model finetuning?

internvl_chat/shell/internlm2_20b_dynamic/internvl_chat_v1_5_internlm2_20b_dynamic_res_finetune.sh how many gpus are need for finetuning? I noticed that for 1-2 version: Note: fine-tune the full LLM needs 16 A100 80G GPUs, and fine-tune the LoRA needs 2...

cyj95

[Feature] 请问 InternVL2-Llama3-76B的训练和推理大概需要多少显存？

5

### Motivation 我这边报内存不够，应该是显存不够的意思吧。目前测试的是推理。 ### Related resources ![显存不够](https://github.com/user-attachments/assets/668a981f-a2af-4d3f-9c94-1e5138fa15b4) ### Additional context _No response_

lckj2009

sunnymoon155

InternVL
InternVL copied to clipboard

Metadata

AttributeError: 'InternVLChatModel' object has no attribute 'batch_chat'

Refer Expression Comprehension RefCOCO 验证结果不一致

运行本地demo时发生错误，恳请帮助，谢谢。

an illegal memory access was encountered when running InternVL−Chat−V1.5-Int8 model

多图推理的最大分辨率问题

how many gpus and how long time does it cost for V1-5 model finetuning?

[Feature] 请问 InternVL2-Llama3-76B的训练和推理大概需要多少显存？

[Feature] 请问， InternVL2-Llama3-76B的推理和训练分别需要多大的显存？我这边每次都报显存不够

[Bug] InternVLChatModel.batch_chat()中缺少设置template.system_message的操作

请问vl2.0的api支持视频输入吗？

← Metadata

Owner

Metadata

InternVL InternVL copied to clipboard

Metadata

← Metadata

Owner

Metadata

InternVL
InternVL copied to clipboard