ctrl icon indicating copy to clipboard operation
ctrl copied to clipboard

How to finetuning with lower memory fp16 version for p100 GPUs?

Open xurongqiang opened this issue 6 years ago • 5 comments

For finetuning with lower memory fp16 version(for fp32 version , OOMs occur. ), How should I modify the training.py script?

xurongqiang avatar Oct 09 '19 10:10 xurongqiang

This is quite delicate and doesn't quite seem to work out of the box. I'm going to need more time to look into this.

keskarnitish avatar Oct 15 '19 17:10 keskarnitish

This is quite delicate and doesn't quite seem to work out of the box. I'm going to need more time to look into this.

The background to this problem is that we have a large number of p100 machines, but they cannot run on the fp32 version. Thank you for your improvement.

xurongqiang avatar Oct 16 '19 03:10 xurongqiang

Is there some update on this matter? We have the same problem, unfortunately.

v1nc3nt27 avatar Feb 05 '20 17:02 v1nc3nt27

Yes same here for us. Both huggingface and this repo seem to have the same OOM error when running on Google Colab free GPU like p100. Any fix or workaround yet?

itsuncheng avatar Feb 11 '20 06:02 itsuncheng

The problem still persists, unfortunately. Fine-tuning doesn't really work with collab resources..

nastyatrvl avatar Jan 02 '22 18:01 nastyatrvl