Why only output the max cls score during inference?
I notice that YOLOX only output the max cls score for each anchor point during inference, why ?
One anchor matches one box in a single class.
One anchor matches one box in a single class.
I haven't seen the use of max-score in other anchor-free methods. Why is it better than dividing the predicted bbox of each anchor into n(num of classes) bboxes with different scores (just like in other anchor-free methods)?
One box multi-classes is widely used, e.g. RetinaNet, FCOS(acnhor free). In my opinion, the method you are talking is used in two stage detector like Faster R-CNN.
I mean it is one box, single-class in YOLOX. But one box mulit-classes in RetinaNet, FCOS. When inference, YOLOX will transform each bbox with multi-classes score to bbox with max score using torch.max before nms, the RetinaNet and FCOS didn't do this...

@FateScript what are scores = outputs[:, 4] * outputs[:, 5] what are pos 4 & 5 for in results, why do we multiply above
@FateScript what are scores = outputs[:, 4] * outputs[:, 5] what are pos 4 & 5 for in results, why do we multiply above
Code here comes from a model named FCOS and it is very common in detection algorithm.
@FateScript what are scores = outputs[:, 4] * outputs[:, 5] what are pos 4 & 5 for in results, why do we multiply above
Code here comes from a model named FCOS and it is very common in detection algorithm. @FateScript Thanks what I want to know what pos 4 and 5 contain