gpt2-dialogue-generation-pytorch icon indicating copy to clipboard operation
gpt2-dialogue-generation-pytorch copied to clipboard

Running on Colab takes 200+ days on GPU

Open oliverbob opened this issue 2 years ago • 1 comments
trafficstars

Hi,

First, I thank you for providing a working repo.

I tried your code on Google Colab, but it seems to take almost a year to train the model on STANDARD (NVIDIA T4 Tensor Core) GPU. Is there a way we can test this small scale? Perhaps even just a CPU?

Thank you.

oliverbob avatar Feb 07 '23 12:02 oliverbob

Hi, thanks for noticing me! I'm not sure why it takes that long because I had no problem when I tried on a basic GPU model (I don't remember the exact model). If it takes almost a year, I don't think this is a problem of GPU. Check if you are using the GPU runtime correctly first, and if there is no problem, let me see your code cells of your colab environment. Thanks.

devjwsong avatar Feb 11 '23 21:02 devjwsong