牛宇霖

Results 5 comments of 牛宇霖

@hiyouga 好的,谢谢您的回复。如果我先merge lora后再训练,那我的model_name_or_path和adapter_name_or_path这两个参数,都是指向同一个合并后的模型吗? 还是说我脚本中直接删除adapter_name_or_path只保留model_name_or_path? 非常感谢!

@hiyouga 谢谢您!我按您说的方式试了一下,DPO训练后模型输出正常了。非常感谢!

I encountered the same error on kaggle's TPU VM v3-8 when using lit-gpt project's example finetuning code today. is there any progress on this issue?

Sure. I sent you them by email. Please check it. Thank you so much!