DeepSeek-Coder-V2 icon indicating copy to clipboard operation
DeepSeek-Coder-V2 copied to clipboard

Any plan to release the fintune example?

Open SupercarryNg opened this issue 1 year ago • 3 comments

Great Work and Congraduations! Is there any plan to release a fintune example code for DeepSeek-Coder-V2? I noticed that you mentioned about finetuning this model with 8*A100 GPUs with some skills, could you be more specific? THX!

SupercarryNg avatar Jun 18 '24 02:06 SupercarryNg

We use a self-developed fine-tuning framework and code, so we cannot release it. We are currently trying to use the open-source DeepSpeed for fine-tuning. If there is any progress, we will update the README as soon as possible.

guoday avatar Jun 20 '24 07:06 guoday

We use a self-developed fine-tuning framework and code, so we cannot release it. We are currently trying to use the open-source DeepSpeed for fine-tuning. If there is any progress, we will update the README as soon as possible.

Is there any update on this? Looking forward to your release of the SFT code.

fengyang95 avatar Jul 02 '24 11:07 fengyang95

you can refer to the following links: https://github.com/datawhalechina/self-llm/blob/master/DeepSeek-Coder-V2/04-DeepSeek-Coder-V2-Lite-Instruct%20Lora%20%E5%BE%AE%E8%B0%83.md https://github.com/DYF-AI/custom-swift

guoday avatar Jul 03 '24 05:07 guoday