DeepSeek-Coder-V2 icon indicating copy to clipboard operation
DeepSeek-Coder-V2 copied to clipboard

add fine-tuning code with lora support

Open Muhtasham opened this issue 1 year ago • 5 comments

Closes #35 #41 #58

Muhtasham avatar Aug 25 '24 00:08 Muhtasham

@guoday

Muhtasham avatar Aug 25 '24 00:08 Muhtasham

I need this to fine-tune

kael53 avatar Jan 29 '25 11:01 kael53

I remember you telling me about Deepseek in May, props to you @Muhtasham 🏃‍♂️

BehsadRiemer avatar Jan 30 '25 02:01 BehsadRiemer

@Muhtasham Should the prompt being built in build_instruction_prompt not match the example in the README.md:

<|begin▁of▁sentence|>User: {user_message_1}

Assistant: {assistant_message_1}<|end▁of▁sentence|>User: {user_message_2}

Assistant:

Apologies if I misunderstand.

adamreed90 avatar Feb 01 '25 19:02 adamreed90

@adamreed90 Yes this PR is specifically for instruction fine-tuning, so the prompt format in build_instruction_prompt is intentionally different from the chat-based format:

As I pointed in the new README.MD, for training data preparation, please follow the Sample Dataset Format.

If you’re bringing a dataset in a different format (such as chat-based), it would require modification.

Muhtasham avatar Feb 01 '25 22:02 Muhtasham