Cheng Li

Results 26 comments of Cheng Li
trafficstars

Hi @lucasleesw , thanks for the PR. Is there a reason you use __str__ instead of __str__()?

ZeRO inference currently takes the general deepspeed config json path. Do we want to expose DS-inference config in the deepspeed config file as well?

@LM-AuroTripathy , thanks for pointing this out. This will be fixed by https://github.com/microsoft/DeepSpeed/pull/2443.

Hi @GongCQ AND @insujang, this is expected if start_profile and end_profile are not set correctly. the profiler is supposed to be started and ended on a single training step (in...

@clumsy , sorry for the late response, I was out of office for a while. Can you please resolve the conflicts with the main branch? Thanks.

> Sure @cli99, are you in favor of the proposed changes though? Now I also need to find a way to make it reflect the expert parameters. Most changes look...

Hi @yerimChoi, the flops profiler does not support deepspeed pipeline parallel engine yet, if you use pp_size > 1, disable the flops profiler

Hi @gauravmunjal13 , thanks for pointing out this issue. It shall be fixed by https://github.com/microsoft/DeepSpeed/pull/3782