Megatron-DeepSpeed icon indicating copy to clipboard operation
Megatron-DeepSpeed copied to clipboard

Test different layer norm

Open thomasw21 opened this issue 4 years ago • 0 comments

Script to reproduce diverging layer_norm weights

thomasw21 avatar Mar 24 '22 13:03 thomasw21