nanoGPT
nanoGPT copied to clipboard
How fast is it compared to megatron, deepspeed and something like that?
They say they have pipeline parallel, tensor parallel, data parallel and something like that. Can nanoGPT be faster than such parallels?