sgd topic
gradient-descent
A research project on enhancing gradient optimization methods
Crowded-Valley---Results
This repository contains the results for the paper: "Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers"
sgdtk
A Java library for Stochastic Gradient Descent (SGD)
dpwa
Distributed Learning by Pair-Wise Averaging
awd-lstm-lm
LSTM and QRNN Language Model Toolkit for PyTorch
LightNet
Efficient, transparent deep learning in hundreds of lines of code.
nfnets-pytorch
NFNets and Adaptive Gradient Clipping for SGD implemented in PyTorch. Find explanation at tourdeml.github.io/blog/
terngrad
Ternary Gradients to Reduce Communication in Distributed Deep Learning (TensorFlow)
SGDLibrary
MATLAB/Octave library for stochastic optimization algorithms: Version 1.0.20