Mayank Mishra
Mayank Mishra
@microsoft-github-policy-service agree
@jeffra @RezaYazdaniAminabadi @mrwyattii , I am unsure why the test is failing here. The is CUDA OOM in `nv-torch18-v100 / unit-tests` Doesn't seem related to the changes in this PR....
Thanks @tjruwase :) Want to get this in and start exploring the possibility of integrating torch.compile into deepspeed for accelerating training :)
Not sure why the CI is crashing @tjruwase . Doesn't seem to be a bug on my end. I will restart the CI build again once.
CI still seems to be broken :(
@jeffra @tjruwase the tests are still crashing, seems unrelated to this PR. Is the transformers version correct? ```shell E Traceback (most recent call last): E File "/tmp/actions-runner/_work/DeepSpeed/DeepSpeed/transformers/examples/pytorch/language-modeling/run_clm.py", line 635, in...
Successful checks @jeffra @tjruwase
its broken again :|
Thanks Stas :)
Can we prioritize this one? @tjruwase :)