DeepSeek-VL issues

TypeError: Object of type AlignerConfig is not JSON serializable

1

当我运行代码 `vl_gpt: MultiModalityCausalLM = AutoModelForCausalLM.from_pretrained(model_path, trust_remote_code=True)` 时会报错 `TypeError: Object of type AlignerConfig is not JSON serializable` 我的模型文件是huggingface上的1.3B-chat版本 when I run code `vl_gpt: MultiModalityCausalLM = AutoModelForCausalLM.from_pretrained(model_path, trust_remote_code=True)` it show error like...

JingyuanHan

dataset format of pretraining stage

How did you unify the format of pretraining dataset? During supervised fine tuning stage, the training data are curated as question and answer pairs. For caption or detection dataset, I...

annopackage

How to get feature representation vector of input image?

Thanks the nice work~! The image embedding has the size of torch.Size([1, 628, 4096]) when input one image. How to get the one feature vector of size 4096 or some...

lukywangwang

多图的数量上限

2

请问DeepSeek-VL最多支持多少张图片输入？

JUNJIE99

How can I get a GenerateOutput after language_model.generate() ?

I hope to get the output.scores, but after generate, it have just returned a Torch.Tensor. How to rewrite the generate function ?

ApolloRay

How to convert it to GGUF/GGML for general use?

Sorry for this dummy question but I did search for some answers and try before. Using llama.cpp ```shell python ./convert-hf-to-gguf.py \ ../../deepseek-vl-7b-chat \ --outtype f16 \ --outfile ../../deepseek-vl-7b-chat/deepseek-v1-7b-chat.gguf ``` returned...

YuanfengZhang

ValueError: The checkpoint you are trying to load has model type `multi_modality` but Transformers does not recognize this architecture. This could be because of an issue with the checkpoint, or because your version of Transformers is out of date.

5

您好！我下载该模型搭配LLamafactory框架，在做api部署的时候，报以下错误 [2024-10-01 00:15:35,483] [INFO] [real_accelerator.py:203:get_accelerator] Setting ds_accelerator to cuda (auto detect) [INFO|configuration_utils.py:670] 2024-10-01 00:15:38,538 >> loading configuration file /mnt/ssd2/models/deepseek-vl-7b-chat/config.json Traceback (most recent call last): File "/home/ubuntu/miniconda3/envs/panc_math_vscode/lib/python3.10/site-packages/transformers/models/auto/configuration_auto.py", line 1023, in...

Y-PanC

Feedback on Multimodal Performance%2C Lightweight Models%2C and Transparency

Hello DeepSeek Team%21 %F0%9F%91%8B%0A%0AFirst of all%2C thank you for...

Kitburg1

deepseek-vl可视化无法正常启动

2

I'm a newer, under windows, i strictly have followed the step of deepseek-vl to configure , but finally i got an error as fllows, what is the cause? ![Image](https://github.com/user-attachments/assets/0836296d-32a0-4764-8c68-040050b0e444) ![Image](https://github.com/user-attachments/assets/1a09dbab-06f4-4d47-8f1b-017c768dbeda)

pengshunzheng

The checkpoint you are trying to load has model type `multi_modality` but Transformers does not recognize this architecture. This could be because of an issue with the checkpoint, or because your version of Transformers is out of date.

作者您好，我打算使用deepseek-vl-7b-chat，但是报错：The checkpoint you are trying to load has model type `multi_modality` but Transformers does not recognize this architecture. This could be because of an issue with the checkpoint, or because...

Wuwenji18

DeepSeek-VL
DeepSeek-VL copied to clipboard

Metadata

TypeError: Object of type AlignerConfig is not JSON serializable

dataset format of pretraining stage

How to get feature representation vector of input image?

多图的数量上限

How can I get a GenerateOutput after language_model.generate() ?

How to convert it to GGUF/GGML for general use?

ValueError: The checkpoint you are trying to load has model type `multi_modality` but Transformers does not recognize this architecture. This could be because of an issue with the checkpoint, or because your version of Transformers is out of date.

Feedback on Multimodal Performance%2C Lightweight Models%2C and Transparency

deepseek-vl可视化无法正常启动

The checkpoint you are trying to load has model type `multi_modality` but Transformers does not recognize this architecture. This could be because of an issue with the checkpoint, or because your version of Transformers is out of date.

← Metadata

Owner

Metadata

DeepSeek-VL DeepSeek-VL copied to clipboard

Metadata

← Metadata

Owner

Metadata

DeepSeek-VL
DeepSeek-VL copied to clipboard