FasterRCNNTutorial icon indicating copy to clipboard operation
FasterRCNNTutorial copied to clipboard

ssd training loss

Open elifbykl opened this issue 7 years ago • 2 comments

Hi Jasper,

On the faster r-cnn model, the training loss starts at 4. on the same dataset, why is it starting from 7 on ssd model?

elifbykl avatar Jun 11 '18 07:06 elifbykl

The two architectures use different methods for calculating the loss, so you can't compare the values between them. Faster RCNN uses a smoothed L1 norm loss on the box locations while SSD uses an L2 norm. The total loss for both is a weighted combination of classification loss and position loss, in SSD this has an extra hyperparameter (alpha) which you need to tune.

jaspereb avatar Jun 13 '18 00:06 jaspereb

I think SSD typically has higher loss value than Faster RCNN for the same dataset from my experience. I am not sure this can be ported directly to better performance. Your difference in losses is minimal I have encounters quite big differences especially in the final values.

eypros avatar Jul 03 '18 07:07 eypros