PreSumm icon indicating copy to clipboard operation
PreSumm copied to clipboard

How to resume checkpoint for continuous training?

Open ithieund opened this issue 4 years ago • 6 comments

Hi @nlpyang , can you give me some suggestion on how to resume the previous checkpoint model for continuous training without having to train the model from beginning at different session?

Thank you very much.

ithieund avatar Feb 22 '21 02:02 ithieund

use -train_from <model_path> arguement to continue where left

henrypham67 avatar Jun 29 '21 06:06 henrypham67

After I set this parameter it has no effect,I set the parameter -train_from ../models/model_step_4000.pt. it right?thank you

SabrinaZhuangxx avatar Jul 12 '22 04:07 SabrinaZhuangxx

After I set this parameter it has no effect,I set the parameter -train_from ../models/model_step_4000.pt. it right?thank you

Could you provide full log?

henrypham67 avatar Jul 13 '22 02:07 henrypham67

Same here. It just loads the data file and then exits

vedantgannu avatar Dec 17 '23 08:12 vedantgannu

image

vedantgannu avatar Dec 17 '23 08:12 vedantgannu

Nvm figured out the issue. Basically need to make sure your -train_steps value is higher than the checkpoint's train step

vedantgannu avatar Dec 22 '23 14:12 vedantgannu