RAdam-Tensorflow
RAdam-Tensorflow copied to clipboard

Published 20 hours ago •

→

Simple Tensorflow implementation of "On The Variance Of The Adaptive Learning Rate And Beyond"

RAdam-Tensorflow

from RAdam import RAdamOptimizer

train_op = RAdamOptimizer(learning_rate=0.001, beta1=0.9, beta2=0.999, weight_decay=0.0).minimize(loss)

result

Simple Tensorflow implementation of "On The Variance Of The Adaptive Learning Rate And Beyond"

Stars

Forks

Watchers

Stars

Forks

Watchers

Simple Tensorflow implementation of "On The Variance Of The Adaptive Learning Rate And Beyond"