OctoberKat comments

Repositories
Issues
Comments

Results 2 comments of


                                            OctoberKat

Consider if user has GPU/CPU while calling `torch.load()`

I wonder if I can load the model parameters to GPU correctly by simply write: ``` model = EfficientNet.from_pretrained('efficientnet-b3') model.cuda() ```

network can't train when incorporate this

Maybe you should try warmup learning rate sceduler? Transformer is particularly sensitive to learning rate scheme.