DeepSpeed
DeepSpeed copied to clipboard
Transformer/fix layer norm
This PR addresses https://github.com/microsoft/DeepSpeed/issues/581
Can one of the admins verify this patch?
Hi, was this issue fixed? LayerNorm combined with DeepSpeed FP16 still seems to be problematic.