nn-from-scratch gradient checks do not match

gradient checks do not match

Open vuptran opened this issue 7 years ago • 2 comments

I checked the gradients you derived against the numerical gradients, and your implementation does not match. It looks like the error is in two places:

In calculate_loss, you average the total loss (including the regularization term) over the data batch. The correct implementation should average only the log loss, but not the regularization term.
In build_model, the gradients (dW1, dW2, db1, db2) during backprop should be averaged over the data batch. Again, the correct implementation should not include the regularization terms in the average over the data batch.

Jul 08 '17 10:07 vuptran

Do you have or know a better implementation? Can you explain or show to me how you checked it?

May 11 '20 09:05 uripeled2

@uripeled2 I have a method for gradient checking in my implementation here: https://github.com/vuptran/introduction-to-neural-networks

May 11 '20 15:05 vuptran

nn-from-scratch nn-from-scratch copied to clipboard

gradient checks do not match

nn-from-scratch
nn-from-scratch copied to clipboard