LLaMA-Factory
LLaMA-Factory copied to clipboard
baichuan微调后的web demo回复会出现"Human: "
用train_sft的do_predict预测了200条,没有"Human: ",但baichuan微调后的web demo回复会出现"Human: "
repeatPenalty 调高