pytorch-cosine-annealing-with-warmup
pytorch-cosine-annealing-with-warmup copied to clipboard
Allow `max_lr` to be set per group
From #11.
Issue: if you have an optimizer with multiple groups with different learning rates, all of their learning rate values will get overridden by the max_lr
.