DeepSeek-Coder-V2 issues

issue bug

![image](https://github.com/user-attachments/assets/46ae51f6-51d4-4704-b5a5-d26fdec975cf) continue 插件访问coder-v2模型

Hi, I have a question about fine tuning.

1. when inferring from a model to FIM, I want to fine-tune it with a dataset, but the format of the dataset is {"instruction" : "", "output": ""} should I...

Yindy07

What is the FSDP value for `fsdp_transformer_layer_cls_to_wrap`?

Hey there, Trying to fine-tune your model. What is the FSDP value for `fsdp_transformer_layer_cls_to_wrap`? Thanks!

Metaspectral

OutOfMemoryError: CUDA out of memory on RunPod

2

**Description:** While running DeepSeek Coder v2 on RunPod, I encountered a `CUDA out of memory` error. The error message indicated that the system attempted to allocate 20.00 MiB of memory,...

loyal812

请问怎么把deepseek和知识库结合生成私有化的AI Agent

请问怎么把deepseek和知识库结合生成私有化的AI Agent，之前是用字节的coze做的，但是coze的模型效果不太好，最近试了一下deepseek，感觉不错，请问是否有比较靠谱的落地方案

zhanghanting

When will the vllm PR be merged to the main branch?

8

Thank you for your impressive work on this project. I'm eager to try this model, but I've noticed that the `vllm` deployment [pull request](https://github.com/vllm-project/vllm/pull/4650) has conflicts with the main branch,...

zuxin666