pytorch-retinanet
pytorch-retinanet copied to clipboard
Why did you use sigmoid in classification head?
I think sigmoid is not an appropriate choice to use in case of multiple classes in classification branch
In the paper, authors have said that they use sigmoid function to do the classification. They have tried softmax, but sigmoid is better probably.