CogVLM
CogVLM copied to clipboard
a state-of-the-art-level open visual language model | 多模态预训练模型
### System Info / 系統信息 My environment is Windows11 transformers 4.42.3 torch 2.3.0+cu121 deepspeed 0.14.5+unknow ### Who can help? / 谁可以帮助到您? _No response_ ### Information / 问题信息 - [X] The...
### System Info / 系統信息 cuda:12.1 pytorch:2.3.1 python:3.10 gpu:4 a800(4*80g) ubuntu:22.04 apex is OK ### Who can help? / 谁可以帮助到您? _No response_ ### Information / 问题信息 - [X] The official...
### System Info / 系統信息 请扫微信二维码加群,如果群失效,可以添加我微信加入:yx116169  ### Who can help? / 谁可以帮助到您? 1 ### Information / 问题信息 - [X] The official example scripts / 官方的示例脚本 - [X] My own...
Hi, How to check the confidence score or probability score of the generated text? Throwing me error when I try to find the logits
类型问题
### System Info / 系統信息 SwissArmyTransformer>=0.4.9 transformers>=4.36.2 xformers>=0.0.22 torch>=2.1.0 torchvision>=0.16.2 spacy>=3.6.0 pillow>=10.2.0 deepspeed>=0.13.1 seaborn>=0.13.2 loguru~=0.7.2 streamlit>=1.31.0 timm>=0.9.12 accelerate>=0.26.1 pydantic>=2.6.0 # for openai demo openai>=1.16.0 sse-starlette>=1.8.2 fastapi>=0.110.1 httpx>=0.27.0 uvicorn>=0.29.0 jsonlines>=4.0.0 ###...
Hi, thanks for your work. I am wondering what's the difference between two CogVLM models in Table 2 and Table 4. The reason I am asking is that the performance...
Hi, thanks for your work. May I ask did you fine-tune VE and MLP Adaptor at both pre-training stage and SFT stage?, thanks.
### System Info / 系統信息 与官方需求的环境相同 ### Who can help? / 谁可以帮助到您? _No response_ ### Information / 问题信息 - [X] The official example scripts / 官方的示例脚本 - [ ] My...
在第一个阶段使用laion-2B的caption训练数据,放开VIT,mlp projector,vision export训练,freeze大语言模型进行训练,训练过程中loss先慢慢下降,但后面升高了,升高之后发现模型训崩了,已排除了训练数据问题,learning rate也调小了都不行,请问是哪里的问题? 
请问怎么像其他某些VLM那样,可以用transformer来推理?