DeepSeek-VL
DeepSeek-VL copied to clipboard
DeepSeek-VL: Towards Real-World Vision-Language Understanding
Congratulations to DeepSeek for the wonderful work. I wonder if there is a script for fine-tuning DeepSeek-VL? Thanks!
ms-swift大模型训练框架已经支持了deepseek-vl系列模型的推理和微调~ 最佳实践可以查看这里: https://github.com/modelscope/swift/blob/main/docs/source/Multi-Modal/deepseek-vl%E6%9C%80%E4%BD%B3%E5%AE%9E%E8%B7%B5.md
python deepseek_vl/serve/app_deepseek.py Its not working on colab notebook. Need a fix !!
目前看起来,中文的幻觉十分严重。 7B的模型
Hello, awesome work on DeepSeek-VL As you requested, this is a new PR to add the Replicate badge and web demo. (For reference this was the original PR: https://github.com/deepseek-ai/DeepSeek-VL/pull/5#issuecomment-1990645505) This...
如题。 目前开源的多模态大模型似乎都只有有限的agent能力,要么不能完全理解文字,要么图像识别能力无法兼顾,要么没有任务规划能力,要么缺乏函数调度或者解释器能力。 deepseek目前的模型里,67B有文本agent的潜力,但是太大。 很期待deepseek在agent领域的发力。
Detect the platform where the model is loaded and adjust `torch.device` and `torch.dtype` appropriately. I was able to run the model on an M1 Macbook Pro (with poor performance at...
Does DeepSeek-VL series support input of multiple images? This doesn't seem to be stated in the paper, but `images` in the example script are `list`, which seems to be supported.
 类似于这种,img2img
Hey, thanks a lot for sharing this great accomplishment with the community! I have just tried running the `cli_chat` on Python3.11 and I get `ImportError: cannot import name 'Mapping' from...