cs231n icon indicating copy to clipboard operation
cs231n copied to clipboard

Shortest solutions for CS231n 2021-2024

Results 4 cs231n issues
Sort by recently updated
recently updated
newest added

https://github.com/naya0000/cs231n/blob/e1192dc8cbaf078c3cfb691e12b8d6d2ec40c8fa/assignment1/cs231n/classifiers/linear_svm.py#L110 Can someone explain why this subtraction is done? An explanation for derivative calculation.

I am learning the Transformer_Captioning.ipynb in assignment3. After I run the cell of testing MultiHeadAttention, I get some incorrect results: ``` self_attn_output error: 0.449382070034207 masked_self_attn_output error: 1.0 attn_output error: 1.0...

Here is a partial derivative jacobian matrix for softmax: ![image](https://github.com/mantasu/cs231n/assets/56772228/266874b5-b36f-49cb-b1cd-4bbca13a833a) This simplifies to: ![image](https://github.com/mantasu/cs231n/assets/56772228/b203da3b-93b6-43f3-8d93-7967c223bcbc) ``` softmax[y[i]] -= 1 # update for gradient ``` Didn't get this? Can someone explain? For...

question

I have encounterd isues when tring to run the solver in default learning rate 1e-2, when looking for other people's answers, I notice that everyone basically change it to 1e-3...