SQLNet icon indicating copy to clipboard operation
SQLNet copied to clipboard

Problem with resuming the training

Open sravand93 opened this issue 6 years ago • 0 comments

I am using google colab with GPU to train the model. After 27 epochs connection got disconnected. And when i try to run the model again it is starting from epoch1. So how can I resume my training from epoch 27.

I am new to pytorch and when I have gone through the pytorch tutorial of saving and loading models and found that we have to save check points to resume the training. So should i restart my training by saving the checkpoints? If I can resume my training using the models saved earlier,Please help me how to do it.

Thank You.

sravand93 avatar Oct 11 '19 07:10 sravand93