DeepSeek-VL icon indicating copy to clipboard operation
DeepSeek-VL copied to clipboard

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Results 46 DeepSeek-VL issues
Sort by recently updated
recently updated
newest added
trafficstars

当我运行代码 `vl_gpt: MultiModalityCausalLM = AutoModelForCausalLM.from_pretrained(model_path, trust_remote_code=True)` 时会报错 `TypeError: Object of type AlignerConfig is not JSON serializable` 我的模型文件是huggingface上的1.3B-chat版本 when I run code `vl_gpt: MultiModalityCausalLM = AutoModelForCausalLM.from_pretrained(model_path, trust_remote_code=True)` it show error like...

How did you unify the format of pretraining dataset? During supervised fine tuning stage, the training data are curated as question and answer pairs. For caption or detection dataset, I...

Thanks the nice work~! The image embedding has the size of torch.Size([1, 628, 4096]) when input one image. How to get the one feature vector of size 4096 or some...

请问DeepSeek-VL最多支持多少张图片输入?

I hope to get the output.scores, but after generate, it have just returned a Torch.Tensor. How to rewrite the generate function ?

Sorry for this dummy question but I did search for some answers and try before. Using llama.cpp ```shell python ./convert-hf-to-gguf.py \ ../../deepseek-vl-7b-chat \ --outtype f16 \ --outfile ../../deepseek-vl-7b-chat/deepseek-v1-7b-chat.gguf ``` returned...

您好! 我下载该模型搭配LLamafactory框架,在做api部署的时候,报以下错误 [2024-10-01 00:15:35,483] [INFO] [real_accelerator.py:203:get_accelerator] Setting ds_accelerator to cuda (auto detect) [INFO|configuration_utils.py:670] 2024-10-01 00:15:38,538 >> loading configuration file /mnt/ssd2/models/deepseek-vl-7b-chat/config.json Traceback (most recent call last): File "/home/ubuntu/miniconda3/envs/panc_math_vscode/lib/python3.10/site-packages/transformers/models/auto/configuration_auto.py", line 1023, in...

Hello DeepSeek Team%21 %F0%9F%91%8B%0A%0AFirst of all%2C thank you for...

I'm a newer, under windows, i strictly have followed the step of deepseek-vl to configure , but finally i got an error as fllows, what is the cause? ![Image](https://github.com/user-attachments/assets/0836296d-32a0-4764-8c68-040050b0e444) ![Image](https://github.com/user-attachments/assets/1a09dbab-06f4-4d47-8f1b-017c768dbeda)

作者您好,我打算使用deepseek-vl-7b-chat,但是报错:The checkpoint you are trying to load has model type `multi_modality` but Transformers does not recognize this architecture. This could be because of an issue with the checkpoint, or because...