ASP
ASP copied to clipboard
GPU Memory
How much GPU memory is required for model training? If the GPU memory is not enough, how can model parameters be optimized (batch_size has been set to 1)?
How much GPU memory is required for model training? If the GPU memory is not enough, how can model parameters be optimized (batch_size has been set to 1)?
I've been able to start training t5_large based ere model with flant5_large_conll04 config (except use_amp
, I changed it to false. V100 is not working with bf16). It requires 25870MiB of GPU memory.