InternVL
InternVL copied to clipboard
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
如题。。如果pretrain就把图片切那么多份,训练成本是不是有些cover不住
I found that there is a image transformation step on load_image function from the example on huggingface repo (transformers based), but there is not any image processing on the gradio_web_server...
Why does the file internvl_chat_v1_2_hermes2_yi34b_448_finetune.sh include --freeze_backbone False? Isn't the visual encoder supposed to be frozen during the pre-training phase?
Help with this. Thank you ImportError: libtorch_cuda_cu.so: cannot open shared object file: No such file or directory Python 3.8.13 (default, Oct 21 2022, 23:50:54) [GCC 11.2.0] :: Anaconda, Inc. on...
Why do 3* 4090GPUs still out of memory (24*3>52GB) 0 NVIDIA GeForce RTX 4090 Off | 00000000:31:00.0 Off | Off | | 66% 24C P8 22W / 450W | 42MiB...
Thanks for your great job! I follow your tutorial in [https://huggingface.co/OpenGVLab/InternVL-Chat-V1-5-Int8](url) and I found that the model only support single image conversation. I use the Int8 model. for example, i...
Thank you for releasing this wonderful work and keep updating the latest scripts for training and fine-tuning! Recently I have tried to fine-tune the InternVL-V1.5 using custom dataset, and I...
如果最后一行为两个字或者很少的字,会被整理成分类名称,导致返回的json内容中地址信息缺少部分信息 这是提示词的问题,还是模型的问题,不知道如何优化,给指教指教 