PaddleNLP icon indicating copy to clipboard operation
PaddleNLP copied to clipboard

[Question]: 训练中途报错中断后,重新训练如何在已训练的checkpoint上继续训练而不是重新开始训练

Open Matter-Charles opened this issue 1 year ago • 3 comments

请提出你的问题

训练中途报错中断后,重新跑finetune.py代码发现模型重新训练,从checkpiont-100开始,是否有参数可以选择从之前训练过的某个checkpoint开始继续训练?

Matter-Charles avatar Feb 02 '24 02:02 Matter-Charles

重跑就可以了哈,自动继续训练。

gongel avatar Feb 06 '24 05:02 gongel

This issue is stale because it has been open for 60 days with no activity. 当前issue 60天内无活动,被标记为stale。

github-actions[bot] avatar Apr 27 '24 00:04 github-actions[bot]

可以类似这样,指定自己的checkpoint。

https://github.com/PaddlePaddle/PaddleNLP/blob/ac117a108de2d777fe77d542c732dc5a83889b5d/applications/information_extraction/document/finetune.py#L141

w5688414 avatar May 10 '24 12:05 w5688414

This issue is stale because it has been open for 60 days with no activity. 当前issue 60天内无活动,被标记为stale。

github-actions[bot] avatar Jul 10 '24 00:07 github-actions[bot]

This issue was closed because it has been inactive for 14 days since being marked as stale. 当前issue 被标记为stale已有14天,即将关闭。

github-actions[bot] avatar Jul 24 '24 00:07 github-actions[bot]