CCNet icon indicating copy to clipboard operation
CCNet copied to clipboard

about the new support for pytorch 1.x

Open mingminzhen opened this issue 5 years ago • 4 comments

The new support for pytorch 1.x is much better to use for multi-gpu. So does it achieve the same performance for training as previous version for pytorch 0.4.1? @speedinghzl

mingminzhen avatar Oct 01 '19 03:10 mingminzhen

@mingminzhen Thanks for asking. They could achieve the same performance when both use OHEM. Without OHEM, the new version could achieve 78.5+ mIOU, which is lower than the previous one. So the new version is still under improving.

speedinghzl avatar Oct 01 '19 14:10 speedinghzl

@speedinghzl So do you use ohem threshold 0.6 or 0.7 to achieve the same performance?

mingminzhen avatar Oct 05 '19 12:10 mingminzhen

@speedinghzl Actually, when I use my network in the new version, I can just get lower performance. I am not sure what happed? It seems you use DistributedDataParallel from apex, inplace_sync_bn from new version. Is possible these two affect final result?

mingminzhen avatar Oct 05 '19 12:10 mingminzhen

@mingminzhen I think the problem is inplace_sync_bn.

dingguo1996 avatar Nov 06 '19 07:11 dingguo1996