nanoGPT
nanoGPT copied to clipboard
How to train nanoGPT using TPU's?
How can I train nanoGPT using TPU's? Can I just modify the DDP targeting TPU VM's or need to make changes to the model to make it XLA compilable?
need to use pytorch-xla for that, or reimplement in Jax