Transformers-Tutorials
Transformers-Tutorials copied to clipboard
Donut training slowness
Hi, I am currently working on fine-tuning the Donut model on custom dataset. While I train on only 2500 samples, with 30 epoch it takes around 20 hrs on 24GB GPU.
How can we speed up this process?
@NielsRogge
Hi, Any response would be highly appreciated.
Which batch size are you using? Can you verify the GPU is used (by typing nvidia-smi in the terminal)?
I'd recommend this guide: https://huggingface.co/docs/transformers/perf_train_gpu_one
Hi, Any response would be highly appreciated.
I'm guessing you were training with a RTX 4090? Did you manage to get your training times down?