InternVL icon indicating copy to clipboard operation
InternVL copied to clipboard

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Results 461 InternVL issues
Sort by recently updated
recently updated
newest added

如题。。如果pretrain就把图片切那么多份,训练成本是不是有些cover不住

I found that there is a image transformation step on load_image function from the example on huggingface repo (transformers based), but there is not any image processing on the gradio_web_server...

Why does the file internvl_chat_v1_2_hermes2_yi34b_448_finetune.sh include --freeze_backbone False? Isn't the visual encoder supposed to be frozen during the pre-training phase?

Help with this. Thank you ImportError: libtorch_cuda_cu.so: cannot open shared object file: No such file or directory Python 3.8.13 (default, Oct 21 2022, 23:50:54) [GCC 11.2.0] :: Anaconda, Inc. on...

Why do 3* 4090GPUs still out of memory (24*3>52GB) 0 NVIDIA GeForce RTX 4090 Off | 00000000:31:00.0 Off | Off | | 66% 24C P8 22W / 450W | 42MiB...

Thanks for your great job! I follow your tutorial in [https://huggingface.co/OpenGVLab/InternVL-Chat-V1-5-Int8](url) and I found that the model only support single image conversation. I use the Int8 model. for example, i...

Thank you for releasing this wonderful work and keep updating the latest scripts for training and fine-tuning! Recently I have tried to fine-tune the InternVL-V1.5 using custom dataset, and I...

如果最后一行为两个字或者很少的字,会被整理成分类名称,导致返回的json内容中地址信息缺少部分信息 这是提示词的问题,还是模型的问题,不知道如何优化,给指教指教 ![image](https://github.com/OpenGVLab/InternVL/assets/74588507/e697b598-bd04-4931-8499-5f289408a9ca)