triton
triton copied to clipboard
[TUTORIAL] Minor update and clean up the tutorial05
- Based on the formula and the actual computation code, the
B
andeps
are not used indef _layer_norm_bwd_dx_fused
and are thus removed for clarity. - Some other minor clean-ups.
Changes have tested on GPU, execution parity.