apex icon indicating copy to clipboard operation
apex copied to clipboard

AttributeError: module 'torch.distributed' has no attribute '_all_gather_base'

Open AungV opened this issue 3 years ago • 7 comments

https://github.com/NVIDIA/apex/blob/6a40a0ad9ff3d6ebea715cf28faf39792312acbf/apex/transformer/utils.py#L10

when i use FusedAdam in torch1.8, there is no all_gather_into_tensor or _all_gather_base in dir(torch.distributed).

AungV avatar Nov 10 '22 03:11 AungV

me too, do u solve it?

anber1 avatar Nov 10 '22 12:11 anber1

me too, do u solve it?

use old version 😀

AungV avatar Nov 10 '22 12:11 AungV

ok thanks

anber1 avatar Nov 10 '22 13:11 anber1

me too, do u solve it?

use old version 😀

What the old version is?How to use the old version?

toohappygirl avatar Mar 25 '23 13:03 toohappygirl

me too, do u solve it?

use old version 😀

What the old version is?How to use the old version?

find when this code be changed(added).

AungV avatar Apr 01 '23 10:04 AungV

I uninstalled the current version and installed "22.04-dev" of apex, then solved the same issue. (torch 1.9.1, cuda 11.1)

zhiyuuu avatar Apr 12 '23 06:04 zhiyuuu

I uninstalled the current version and installed "22.04-dev" of apex, then solved the same issue. (torch 1.9.1, cuda 11.1)

Thank you very much, the solution works perfectly for me. For reference, my PyTorch version is pytorch=1.8.1=py3.8_cuda11.1_cudnn8.0.5_0.

callanwu avatar Mar 14 '24 01:03 callanwu