zimenglan
zimenglan
@bailvwangzi, since it's my friend's results, so i can't give you the loss, and the hyper-parameters.
hi @bailvwangzi, here i don't compare it with ohem, i just want to prove `focal loss` works or not.
hi @liuchy666, i replace the softmax loss of RCNN with focal loss.
hi @ZHAIXINGZHAIYUE, `beta` is another format of `focal loss`, i don't implement it yet.
hi @jjkke88, which framework do you use?
hi @PhyscalX, here i just ignore the sign, because it can be dismissed by multiplying the ` -1 * p_i (p_i - 1)` or `-1 * p_j * p_i`, which...
hi @PhyscalX, you are right, the loss is computed wrong, and thanks for reminding of `log` operation. i will update my code tomorrow and do some test. for the first...
hi @PhyscalX, so thanks a lot. I have fixed the problems that u tell me. for the gradient, i forgot to derivate the (1 - p_t), to ignore the sign....
hi @PhyscalX, you're right, eps is very important, i add it to solve the `NaN` problem. Right now, it can run normally, you can have a look. thanks for pointing...
hi @PhyscalX, i have updated. thanks a lot.