open_lm
open_lm copied to clipboard
Fused RMSNorm
Adding triton kernel for rms norm and testing
#3
Next step is to test this on multinode, I'll look into this after Nov 17