open-solution-mapping-challenge icon indicating copy to clipboard operation
open-solution-mapping-challenge copied to clipboard

when training finishes it hits the memory error

Open jakubczakon opened this issue 5 years ago • 2 comments

Running train crashes when the pipelines are collating results from running transform on the entire dataset.

jakubczakon avatar Jul 18 '19 10:07 jakubczakon

did you find any solution

animeshsahu80 avatar Jul 23 '19 22:07 animeshsahu80

Well, the training is working and the checkpoints are saved after every epoch (if improved) so the trained model is available.

All that needs to be done is:

cp experiment/checkpoints/unet/best.torch experiment/transformers/unet

and then run the evaluation/prediction.

So it is more of an inconvenience than a big problem.

jakubczakon avatar Jul 24 '19 07:07 jakubczakon