Pytorch-UNet icon indicating copy to clipboard operation
Pytorch-UNet copied to clipboard

Training stuck at the end of the first epoch

Open XxUpUp opened this issue 1 year ago • 4 comments

Hi, I'm having the problem that when the last batch of my first epoch is trained, the program can't start the next epoch but stays stuck there and no error message appears, what could be the reason for this? (Note: I have made some changes to train.py, but not the main body of the code) image

XxUpUp avatar Jul 25 '24 16:07 XxUpUp

什么原因呢?

zaoyueri avatar Sep 07 '24 11:09 zaoyueri

什么原因呢?

I'm still not quite sure what the cause of this problem is.

XxUpUp avatar Sep 12 '24 09:09 XxUpUp

请问你解决了吗,我也遇到了一模一样的问题

chenshans avatar Sep 20 '24 08:09 chenshans

请问你解决了吗,我也遇到了一模一样的问题

Not yet.

XxUpUp avatar Sep 20 '24 10:09 XxUpUp

You can try use PyTorch 1.13 or later, I solve it by change pytorch verson

ZafirTan avatar Nov 23 '24 05:11 ZafirTan

请问解决了吗 我也遇到了一样的问题

cvliu-hub avatar Mar 03 '25 01:03 cvliu-hub

请问解决了吗 我也遇到了一样的

XxUpUp avatar Mar 03 '25 10:03 XxUpUp

请问解决了吗 我也遇到了一样的问题

Perhaps you could try adjusting the learning rate, for example modifying it from 0.0001 to 0.001

XxUpUp avatar Mar 03 '25 10:03 XxUpUp