Focal-Loss-implement-on-Tensorflow
Focal-Loss-implement-on-Tensorflow copied to clipboard
tf.nn.sigmoid and tf.nn.softmax diff?
The paper of He proposed sigmoid has more robust training effect than softmax, But he didn't explain why. You can have a try. I think maybe it depends the dependent of each probability of result, softmax may have more small probability values(all sum is 1), which may cause more negative samples.