pytorch-cosine-annealing-with-warmup
pytorch-cosine-annealing-with-warmup copied to clipboard
Warmup steps only apply on the first cycle
From #11.
Currently, warmup is applied in all cycles. Therefore, it is possible to apply warmup only in the first cycle by using an argument. Similar to here.