Mayank Mishra

Results 187 comments of Mayank Mishra

@microsoft-github-policy-service agree

@jeffra @RezaYazdaniAminabadi @mrwyattii , I am unsure why the test is failing here. The is CUDA OOM in `nv-torch18-v100 / unit-tests` Doesn't seem related to the changes in this PR....

Thanks @tjruwase :) Want to get this in and start exploring the possibility of integrating torch.compile into deepspeed for accelerating training :)

Not sure why the CI is crashing @tjruwase . Doesn't seem to be a bug on my end. I will restart the CI build again once.

@jeffra @tjruwase the tests are still crashing, seems unrelated to this PR. Is the transformers version correct? ```shell E Traceback (most recent call last): E File "/tmp/actions-runner/_work/DeepSpeed/DeepSpeed/transformers/examples/pytorch/language-modeling/run_clm.py", line 635, in...

Can we prioritize this one? @tjruwase :)