pal
pal copied to clipboard
Fine-tune What the parameters look like
If I don't change the @torch.no_grad() in the code, I can't run it and get an error that the loss forward propagation cannot be done. If I comment out @torch.no_grad(), there is not enough video memory. I would like to ask what the code looks like and what are the startup parameters for model fine-tuning?