nanoGPT
nanoGPT copied to clipboard
Using float16 via Gradscaler
https://huggingface.co/facebook/opt-30b
Larger open source model? Does it work?
It seems OPT uses float16. https://github.com/karpathy/nanoGPT/blob/3e0fd425794e3e0160ae6132916beff78823888a/train.py#L85-L88
How do I use gradscaler?