fpn.pytorch
fpn.pytorch copied to clipboard
rcnn_box loss equals to 0 when training
@jwyang Hello, I trained fpn with resnext backbone on my own datasets and I get zero rcnn_box loss during the training, I wonder if there is something wrong ? meanwhile, total loss is fluctuating around 0.2+ below are some output :
[session 1][epoch 1][iter 20200] loss: 0.2516, lr: 1.00e-03
fg/bg=(2/254), time cost: 75.063063
rpn_cls: 0.0987, rpn_box: 0.0070, rcnn_cls: 0.0789, rcnn_box 0.0000
[session 1][epoch 1][iter 20300] loss: 0.2348, lr: 1.00e-03
fg/bg=(4/252), time cost: 71.644270
rpn_cls: 0.1217, rpn_box: 0.0084, rcnn_cls: 0.1442, rcnn_box 0.0000
[session 1][epoch 1][iter 20400] loss: 0.2335, lr: 1.00e-03
fg/bg=(2/254), time cost: 67.127675
rpn_cls: 0.0929, rpn_box: 0.0019, rcnn_cls: 0.0792, rcnn_box 0.0000
[session 1][epoch 1][iter 20500] loss: 0.2261, lr: 1.00e-03
fg/bg=(3/253), time cost: 66.649418
rpn_cls: 0.1408, rpn_box: 0.0142, rcnn_cls: 0.1100, rcnn_box 0.0000
[session 1][epoch 1][iter 20600] loss: 0.2505, lr: 1.00e-03
fg/bg=(4/252), time cost: 67.386812
rpn_cls: 0.1469, rpn_box: 0.0233, rcnn_cls: 0.1441, rcnn_box 0.0000
[session 1][epoch 1][iter 20700] loss: 0.2449, lr: 1.00e-03
fg/bg=(4/252), time cost: 67.287636
rpn_cls: 0.1688, rpn_box: 0.0152, rcnn_cls: 0.1434, rcnn_box 0.0000
[session 1][epoch 1][iter 20800] loss: 0.2117, lr: 1.00e-03
fg/bg=(2/254), time cost: 66.637634
rpn_cls: 0.0515, rpn_box: 0.0037, rcnn_cls: 0.0790, rcnn_box 0.0000
[session 1][epoch 1][iter 20900] loss: 0.2153, lr: 1.00e-03
fg/bg=(2/254), time cost: 68.067170
rpn_cls: 0.0550, rpn_box: 0.0020, rcnn_cls: 0.0788, rcnn_box 0.0000
[session 1][epoch 1][iter 21000] loss: 0.2342, lr: 1.00e-03
fg/bg=(4/252), time cost: 67.029715
rpn_cls: 0.1416, rpn_box: 0.0063, rcnn_cls: 0.1458, rcnn_box 0.0000
[session 1][epoch 1][iter 21100] loss: 0.2384, lr: 1.00e-03
fg/bg=(4/252), time cost: 67.087833
rpn_cls: 0.1267, rpn_box: 0.0053, rcnn_cls: 0.1444, rcnn_box 0.0000
same error How to do it?
@maozezhong ,can you tell me the correspond solution?I have confused about it for a few days.Thanks a lot
I met the same problem.
there are some bugs in this repo, maybe you could try this: https://github.com/DetectionBLWX/FPN.pytorch which is much easier to understand and inherits the property of this repo
@maozezhong hello, I have the same problem,can you help me?