Wei Wang
Wei Wang
@atalman and I would recommend this PR waiting for https://github.com/pytorch/pytorch/pull/121956, which is enabling CUDA 12.4 CI. Please wait for 12.4 CI to land first, thanks!
We discussed for a short term, we would have 11.8, 12.1, and 12.4. I will need to refactor this PR to add back 11.8.
12.4 workflows are failing. Still working on coming up with a fix.
@pytorchbot rebase
@malfet Could you please help take another look? I am composing torchinductor 12.4 issues in [here](https://github.com/pytorch/pytorch/issues/126692). Thanks!
Sorry for the mishaps. The PR went in 05/23 1:37pm, @malfet issued a "pytorch rebase" at 2:22pm on the #125963 PR, the result is based on #126976 (10:31am) I guess...
@clee2000 Good catch! I now realize there might be UCC/UCX related regression that newer UCC/UCX may not be working as well with cuda 11.8.
@pytorchbot revert -m "test failure seems related https://hud.pytorch.org/pytorch/pytorch/commit/5fb4a766b88bcf633a23610bd66de0f3020f7c66 https://github.com/pytorch/pytorch/actions/runs/9085206167/job/24972040039" -c ignoredsignal
@pytorchbot merge
@Skylion007 Sure, I will give it a try today.