FastMaskRCNN
FastMaskRCNN copied to clipboard
How to load resnet-101? (poor result based on resnet-50)
I trained the model with resnet-50 for 200k iterations. But the result is very poor. I wonder if we should use the resnet-101 as the original Mask-RCNN paper?
I think 200k is too small iteration to see any good result. I assume you tried with batch size 1. In the mask rcnn paper, it trained over 160k with effective batch size 16. I think, we should train 16*160k = 2560k at least with batch size 1. Could you share how loss is decreased over time or how accuracy is increased over the 200k iteration?
Hi, It looks like you were able to successfully completed training.
I am trying to start training, but i am getting error
Caused by op u'pyramid_1/AssignGTBoxes/Where_3', defined at:
File "train/train.py", line 339, in
InternalError (see above for traceback): WhereOp: Could not launch cub::DeviceReduce::Sum to count number of true indices. temp_storage_bytes: 1, status: invalid device function [[Node: pyramid_1/AssignGTBoxes/Where_3 = Where_device="/job:localhost/replica:0/task:0/gpu:0"]] [[Node: pyramid_1/fully_connected_3/BiasAdd/_2753 = _Recvclient_terminated=false, recv_device="/job:localhost/replica:0/task:0/cpu:0", send_device="/job:localhost/replica:0/task:0/gpu:0", send_device_incarnation=1, tensor_name="edge_27365_pyramid_1/fully_connected_3/BiasAdd", tensor_type=DT_FLOAT, _device="/job:localhost/replica:0/task:0/cpu:0"]]