gpt2-dialogue-generation-pytorch
gpt2-dialogue-generation-pytorch copied to clipboard
Running on Colab takes 200+ days on GPU
trafficstars
Hi,
First, I thank you for providing a working repo.
I tried your code on Google Colab, but it seems to take almost a year to train the model on STANDARD (NVIDIA T4 Tensor Core) GPU. Is there a way we can test this small scale? Perhaps even just a CPU?
Thank you.
Hi, thanks for noticing me! I'm not sure why it takes that long because I had no problem when I tried on a basic GPU model (I don't remember the exact model). If it takes almost a year, I don't think this is a problem of GPU. Check if you are using the GPU runtime correctly first, and if there is no problem, let me see your code cells of your colab environment. Thanks.