lvxiaochun

Results 2 comments of lvxiaochun

just curious whether this still support to train GPT-Neo from scratch? like GPT2 in aitextgen does. specifically can it be trained on a nvidia GPU with 8G memory (like 3060Ti)

HI: thanks for the reply 1. actually I can't get it run on CPU model : param_applied = fn(param) File "/home/harry/.local/lib/python3.8/site-packages/torch/nn/modules/module.py", line 471, in return self._apply(lambda t: t.cpu()) RuntimeError: CUDA...