Puyuan Liu
Puyuan Liu
My rewards seems even decrasing, despite the decrease in loss   
> @puyuanOT OK i got the solution. Try to disable the hybirdengine, this make the model always repeat 'a a a a a' not sure the reason. Thanks a lot!...
Perhaps it's related to this PR https://github.com/microsoft/DeepSpeedExamples/pull/470?
Looking forward to any replies. This resizing not only leads to the warning `You are resizing the embedding layer without providing a `pad_to_multiple_of` parameter. This means that the new embedding...