DeepSeek-VL icon indicating copy to clipboard operation
DeepSeek-VL copied to clipboard

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Results 46 DeepSeek-VL issues
Sort by recently updated
recently updated
newest added

Congratulations to DeepSeek for the wonderful work. I wonder if there is a script for fine-tuning DeepSeek-VL? Thanks!

ms-swift大模型训练框架已经支持了deepseek-vl系列模型的推理和微调~ 最佳实践可以查看这里: https://github.com/modelscope/swift/blob/main/docs/source/Multi-Modal/deepseek-vl%E6%9C%80%E4%BD%B3%E5%AE%9E%E8%B7%B5.md

python deepseek_vl/serve/app_deepseek.py Its not working on colab notebook. Need a fix !!

目前看起来,中文的幻觉十分严重。 7B的模型

Hello, awesome work on DeepSeek-VL As you requested, this is a new PR to add the Replicate badge and web demo. (For reference this was the original PR: https://github.com/deepseek-ai/DeepSeek-VL/pull/5#issuecomment-1990645505) This...

如题。 目前开源的多模态大模型似乎都只有有限的agent能力,要么不能完全理解文字,要么图像识别能力无法兼顾,要么没有任务规划能力,要么缺乏函数调度或者解释器能力。 deepseek目前的模型里,67B有文本agent的潜力,但是太大。 很期待deepseek在agent领域的发力。

Detect the platform where the model is loaded and adjust `torch.device` and `torch.dtype` appropriately. I was able to run the model on an M1 Macbook Pro (with poor performance at...

Does DeepSeek-VL series support input of multiple images? This doesn't seem to be stated in the paper, but `images` in the example script are `list`, which seems to be supported.

![output_chat](https://github.com/deepseek-ai/DeepSeek-VL/assets/142364107/b06dc129-79f0-445a-85ee-f9da3eca9bd9) 类似于这种,img2img

Hey, thanks a lot for sharing this great accomplishment with the community! I have just tried running the `cli_chat` on Python3.11 and I get `ImportError: cannot import name 'Mapping' from...

environment