LSR icon indicating copy to clipboard operation
LSR copied to clipboard

Question about running train.py

Open YuyingShang opened this issue 4 years ago • 1 comments

when I run the code, no matter lsr-bert model or lsr model, after the evaluate_epoch=30, my process will be killed. For both models, I have set my batch_size=3, lr=1e-3. Do you have any idea about this problem? Looking forward to your reply.

image

image

YuyingShang avatar Mar 15 '21 07:03 YuyingShang

"exit code 137" means out of memory, so you don't have enough memory to train.

I would decrease the batch size to 1 or 2

logan-markewich avatar Apr 20 '21 17:04 logan-markewich