Cheng Li
Cheng Li
Hi @lucasleesw , thanks for the PR. Is there a reason you use __str__ instead of __str__()?
ZeRO inference currently takes the general deepspeed config json path. Do we want to expose DS-inference config in the deepspeed config file as well?
@LM-AuroTripathy , thanks for pointing this out. This will be fixed by https://github.com/microsoft/DeepSpeed/pull/2443.
Hi @GongCQ AND @insujang, this is expected if start_profile and end_profile are not set correctly. the profiler is supposed to be started and ended on a single training step (in...
@rahilbathwal5, thanks for the PR!
@thanks for the PR
@clumsy , sorry for the late response, I was out of office for a while. Can you please resolve the conflicts with the main branch? Thanks.
> Sure @cli99, are you in favor of the proposed changes though? Now I also need to find a way to make it reflect the expert parameters. Most changes look...
Hi @yerimChoi, the flops profiler does not support deepspeed pipeline parallel engine yet, if you use pp_size > 1, disable the flops profiler
Hi @gauravmunjal13 , thanks for pointing out this issue. It shall be fixed by https://github.com/microsoft/DeepSpeed/pull/3782