best-rq-pytorch icon indicating copy to clipboard operation
best-rq-pytorch copied to clipboard

What are the GPU memory specs needed to run pretraining and kmeans?

Open xanguera opened this issue 1 year ago • 1 comments

Hi, I am trying to run the pretraining of the full model (which should have ~650M parameters) in a 24GB GPU card and it only runs if I set the batch size to 1 (totally useless training). What would be the memory necessary to run the full training with the preset batch size? Also, Once finished training, I tried to run the Kmeans fitting script and it seems to require even more memory. Any idea as well on what is needed?

Thanks!

xanguera avatar Dec 13 '23 14:12 xanguera