ASP icon indicating copy to clipboard operation
ASP copied to clipboard

GPU Memory

Open sherylxun opened this issue 2 years ago • 1 comments

How much GPU memory is required for model training? If the GPU memory is not enough, how can model parameters be optimized (batch_size has been set to 1)?

sherylxun avatar Feb 20 '23 11:02 sherylxun

How much GPU memory is required for model training? If the GPU memory is not enough, how can model parameters be optimized (batch_size has been set to 1)?

I've been able to start training t5_large based ere model with flant5_large_conll04 config (except use_amp, I changed it to false. V100 is not working with bf16). It requires 25870MiB of GPU memory.

Niklss avatar Jun 04 '23 21:06 Niklss