DeepSeek-Coder-V2 add fine-tuning code with lora support

add fine-tuning code with lora support

Open Muhtasham opened this issue 1 year ago • 5 comments

Closes #35 #41 #58

Aug 25 '24 00:08 Muhtasham

@guoday

Aug 25 '24 00:08 Muhtasham

I need this to fine-tune

Jan 29 '25 11:01 kael53

I remember you telling me about Deepseek in May, props to you @Muhtasham 🏃‍♂️

Jan 30 '25 02:01 BehsadRiemer

@Muhtasham Should the prompt being built in build_instruction_prompt not match the example in the README.md:

<｜begin▁of▁sentence｜>User: {user_message_1}

Assistant: {assistant_message_1}<｜end▁of▁sentence｜>User: {user_message_2}

Assistant:

Apologies if I misunderstand.

Feb 01 '25 19:02 adamreed90

@adamreed90 Yes this PR is specifically for instruction fine-tuning, so the prompt format in build_instruction_prompt is intentionally different from the chat-based format:

As I pointed in the new README.MD, for training data preparation, please follow the Sample Dataset Format.

If you’re bringing a dataset in a different format (such as chat-based), it would require modification.

Feb 01 '25 22:02 Muhtasham

DeepSeek-Coder-V2 DeepSeek-Coder-V2 copied to clipboard

add fine-tuning code with lora support

DeepSeek-Coder-V2
DeepSeek-Coder-V2 copied to clipboard