juice icon indicating copy to clipboard operation
juice copied to clipboard

Implement dropout

Open drahnr opened this issue 6 years ago • 4 comments

Should be pretty straight forward, warmup for #10

  • [x] expand the cudnn bindings in rcudnn
  • [x] use the rcudnn bindings in coaster-nn
  • [x] create a apropriate interface in coaster
  • [x] use that interface to define a layer in juice
  • [ ] implement tests

Paper: http://www.cs.toronto.edu/~rsalakhu/papers/srivastava14a.pdf

drahnr avatar Jul 27 '17 19:07 drahnr

Why backprop of it commented out?

andreytkachenko avatar Jun 18 '19 08:06 andreytkachenko

If I read the paper correctly, the backpropagation is just a unit factor which can be skipped. I am on my phone so I cannot review the code right now, the backprop will skip all non existent elements during backprop which enables a good speedup IIRC.

drahnr avatar Jun 18 '19 16:06 drahnr

Actually that is incorrect, backprop should only propagate back on the thinned network (section 5.1 of the linked paper) so unless the weights are zero, backprop may not be skipped

drahnr avatar Jun 18 '19 21:06 drahnr

Reviewing the paper, the thinned paper essentially is setting the gradient to zero which is easily done. The normalization should be realized by means of an additional mechanic or variation parameter which can be introduced in a separate PR.

drahnr avatar Jan 02 '20 03:01 drahnr