torchdistx
torchdistx copied to clipboard
torchrun with deferred_init hang
i found that when use torchdistx with deferred_init it will hang as cuda memory copy