neuron
neuron copied to clipboard
Stochastic Gradient Training with mini-batch mode
SGD is hard to use ...
Ref: http://yaroslavvb.blogspot.com/2014/03/stochastic-gradient-methods-2014.html
SGD with momentum works well.