neural-network-from-scratch
neural-network-from-scratch copied to clipboard

Published 20 hours ago •

gy910210

→

Metadata

Implementing Multiple Layer Neural Network from Scratch

Reame
Issues

Results 3 neural-network-from-scratch issues

Sort by recently updated

Extra tanh?

In the backpropagation part, the first line code writes: dtanh = softmaxOutput.diff(forward[len(forward)-1][2], y) So it is activated then sent to softmax? I guess for the last layer there is no...

dhzyingz

(tanx)'=sec²x=1+tan²x

i think this is right! def backward(self, X, top_diff): output = self.forward(X) return (1.0 + np.square(output)) * top_diff beacuse (tanx)'=sec²x=1+tan²x

lls3018

Do we need to derive differential/gradient w.r.t. input data?

Very confusion. I search a lot about BP algorithm, Some notes says it is ok only to differential w.r.t. W(parameter) and use residual to get gradient ? Your example seems...

machanic

About

Implementing Multiple Layer Neural Network from Scratch

neural-network

gradient-descent

computation-graph

314

Stars

91

Forks

Watchers

Owner

gy910210

← Metadata

314

Stars

91

Forks

Watchers

Owner

gy910210

Metadata

Implementing Multiple Layer Neural Network from Scratch

Back

neural-network-from-scratch neural-network-from-scratch copied to clipboard

Metadata

Extra tanh?

(tanx)'=sec²x=1+tan²x

Do we need to derive differential/gradient w.r.t. input data?

← Metadata

Owner

Metadata

neural-network-from-scratch
neural-network-from-scratch copied to clipboard