phalani
Results
1
issues of
phalani
**Is your feature request related to a problem? Please describe.** At present, DeepSpeed’s inference communication backend defaults to using the SHM-based operation torch.ops.deepspeed.inference_all_reduce_ for performing all_reduce operations when the shared...
enhancement