Mayank Mishra
Mayank Mishra
Ayo!!! @stas00 https://github.com/microsoft/DeepSpeed/pull/2995
Thanks @stas00 Ill test this out sometime today :)
@ShijieZZZZ fyi @tjruwase
[global_step4000.zip](https://github.com/microsoft/DeepSpeed/files/11266342/global_step4000.zip) This checkpoint fails to load. The current fix works. But not sure if there is a better solution :)
@tjruwase do you think there can be a better solution here though? the variable `shared_params` contains a lot of variables which are not actually shared because of the logic by...
no worries, lets work on a better solution as soon as possible though. :)
> > [global_step4000.zip](https://github.com/microsoft/DeepSpeed/files/11266342/global_step4000.zip) This checkpoint fails to load. The current fix works. But not sure if there is a better solution :) > > Can you share the stack trace...
@tjruwase I don't think I can share anything more than the checkpoints as zip: https://github.com/microsoft/DeepSpeed/pull/3295#issuecomment-1513800129 You should be able to easily reproduce with this.
yes @tjruwase thanks. Sorry for the late response. Closing this
lets merge this?