DeepSpeed icon indicating copy to clipboard operation
DeepSpeed copied to clipboard

FusedAdam Hangs during Initialization

Open lipiji opened this issue 1 year ago • 1 comments

I am testing the accelerate training code and I modify the optimizer of: https://github.com/huggingface/accelerate/blob/main/examples/nlp_example.py#L141 to the FusedAdam (from deepspeed.ops.adam import FusedAdam), then after "python ./nlp_example.py" I find that the progress hands on the line of "optimizer = FusedAdam(params=model.parameters(), lr=lr)".

What are the possible reasons? https://github.com/huggingface/accelerate/tree/main/examples

lipiji avatar May 03 '23 04:05 lipiji

Hi @lipiji I was unable to reproduce the bug. Could you provide information about the environment setup you are using?

jomayeri avatar May 04 '23 19:05 jomayeri

Closing now. Feel free to reopen if needed.

jomayeri avatar May 11 '23 22:05 jomayeri