Rahim Shamsy

Results 1 issues of Rahim Shamsy

In calculating gradients, the gradient of the softmax function is not calculated using the formula that is derived in the lecture notes. It seems like in the code, this step...