ProPainter icon indicating copy to clipboard operation
ProPainter copied to clipboard

请问运行train.py时卡住了怎么处理?

Open oxxj opened this issue 1 year ago • 5 comments

我在运行python train.py -c configs/train_propainter.json时,显示下面这种情况

(propainter) ➜ ProPainter git:(main) ✗ python train.py -c configs/train_propainter.json world_size: 4 using GPU 0-0 for training [**] create folder experiments_model/propainter_train_propainter using GPU 1-1 for training using GPU 2-2 for training using GPU 3-3 for training Pretrained flow completion model has loaded... Pretrained flow completion model has loaded... Network [InpaintGenerator] was created. Total number of parameters: 39.4 million. To see the architecture, do print(network). Pretrained flow completion model has loaded... Network [InpaintGenerator] was created. Total number of parameters: 39.4 million. To see the architecture, do print(network). Pretrained flow completion model has loaded... Network [InpaintGenerator] was created. Total number of parameters: 39.4 million. To see the architecture, do print(network). Network [InpaintGenerator] was created. Total number of parameters: 39.4 million. To see the architecture, do print(network). Warnning: There is no trained model found.An initialized model will be used. 0%| | 0/700000 [00:00<?, ?it/s]

另外运行python train.py -c configs/train_flowcomp.json时也一直停在0%这儿,请问这个怎么处理?

oxxj avatar Sep 02 '24 10:09 oxxj

我也是这种情况, 请问你解决了吗

nxrcqupt01 avatar Oct 11 '24 12:10 nxrcqupt01

我也是这种情况,请问解决了吗

我修改了model/propainter.py的183行,将x.view修改为x.reshape

oxxj avatar Oct 12 '24 07:10 oxxj

请问你的训练时长是多少?我在4张A100跑,需要1000个小时,这正常吗?

shenyewei avatar Oct 30 '24 07:10 shenyewei

请问你的训练时长是多少?我在4张A100跑,需要1000个小时,这正常吗?

我当时也跑了很久,用的4张A6000,如果你只是想快速跑完可以修改下train_flowcomp.json/train_propainter.json里的"iterations"参数看看

oxxj avatar Oct 31 '24 08:10 oxxj

我也是这种情况,请问解决了吗

我修改了model/propainter.py的183行,将x.view修改为x.reshape

请问改完了还不好用怎么办

Zhihui-Zheng avatar Mar 21 '25 09:03 Zhihui-Zheng