TransformerEngine
TransformerEngine copied to clipboard
[Paddle] Add main_grad
Support main_grad and fuse_wgrad_accumulation
/te-ci paddle
/te-ci paddle
/te-ci paddle
LGTM