LLaMA-Factory icon indicating copy to clipboard operation
LLaMA-Factory copied to clipboard

deepseek微调后进行推理输出混乱

Open HelloWorld506 opened this issue 10 months ago • 2 comments

Reminder

  • [x] I have read the above rules and searched the existing issues.

System Info

最新版llamafactory

Reproduction

我微调了deepseek-qwen-7B模型,我的输出只有A,B,C,训练时准确率很高,但是推理时会输出思维链,甚至会有<|im_start|>user类似的在input中的词,请问训练时是做了什么操作让其不输出思维链吗,另外推理时输出在input中的词是为什么呢,应该如何解决呢

Others

No response

HelloWorld506 avatar Feb 12 '25 03:02 HelloWorld506