Sam Vance
Results
3
comments of
Sam Vance
I am having the same issue, except in the debug, GOT UDP! shows up in lists of 10 or more
I was able to get the model to run by first converting the weights to deepspeed checkpoints, and then loading the model from those checkpoints. I set deepspeed strategy as...
@HeorhiiS It was a full 7B model. Note that it trained slower than the normal model.