transformer-xl icon indicating copy to clipboard operation
transformer-xl copied to clipboard

run pytorch’s run_wt103_large.sh print 285170506 parameters, but the paper is 128M, and OOM.

Open guotong1988 opened this issue 5 years ago • 1 comments

RuntimeError: CUDA out of memory.

My GPU is 11441MiB.

How to reproduce 128M-model?

Thank you @kimiyoung @zihangdai

guotong1988 avatar Mar 11 '19 01:03 guotong1988

Same problem here, even with a Titan V 32G x 8 system, I run into the OOM problem.

torshie avatar Oct 09 '19 08:10 torshie