CAE icon indicating copy to clipboard operation
CAE copied to clipboard

torch.distributed.elastic.multiprocessiong.erroes.ChildFailedError:

Open linglingl635 opened this issue 2 years ago • 1 comments

why my terminal tell me this problem after training epoch 0? how can I fix it? 47O$WG{BIG$~(TH4LZECK_I

linglingl635 avatar Oct 13 '22 03:10 linglingl635

Hi, we haven't met this problem before and I guess it has nothing to do with the code. Are the environment installed exactly the same as the readme file?

SelfSup-MIM avatar Oct 16 '22 02:10 SelfSup-MIM