wangmingrun1314

Results 2 issues of wangmingrun1314

I use this model(efficient b4) as backbone.However , it occupies a large memory when going on training.Anyone else meets this problem? But ,in fact,it's size

1.这个网络为什么 每隔一个epoch,损失才明显降低? 2.为什么 我设置 num_workers=4,8或其他数 损失就不会降低? 3.我用DistributedDataParallel 总是会报错:set 'find_unused_parameters=True` to torch.nn.parallel.DistributedDataParallel? 都是些好奇怪的错误。 谢谢!