ocannl icon indicating copy to clipboard operation
ocannl copied to clipboard

Study and incorporate Andrej Karpathy's `llm.c` lessons

Open lukstafi opened this issue 1 year ago • 3 comments

"A few new CUDA hacker friends joined the effort and now llm.c is only 2X slower than PyTorch"

https://github.com/karpathy/llm.c

lukstafi avatar Apr 13 '24 07:04 lukstafi

https://twitter.com/karpathy/status/1779354343013269929

lukstafi avatar Apr 14 '24 09:04 lukstafi

https://twitter.com/karpathy/status/1781387674978533427 achieved parity with PyTorch FP32

lukstafi avatar Apr 19 '24 21:04 lukstafi

The "study" part is certainly aiming at versions 0.6.x, but many solutions will wait till 0.9.x.

lukstafi avatar Sep 20 '24 11:09 lukstafi