DUL
DUL copied to clipboard
About softmax loss of cls loss
It seems like you did not use softmax loss to classification as the paper said.
You just used a fully connected layer, so it is another CEloss.
Is there anything that I have missed? or it is a little mistake?