OctoberKat
Results
2
comments of
OctoberKat
I wonder if I can load the model parameters to GPU correctly by simply write: ``` model = EfficientNet.from_pretrained('efficientnet-b3') model.cuda() ```
Maybe you should try warmup learning rate sceduler? Transformer is particularly sensitive to learning rate scheme.