ColossalAI icon indicating copy to clipboard operation
ColossalAI copied to clipboard

[BUG]: Torch compile causes multi-process to hang with python 3.9

Open Edenzzzz opened this issue 1 year ago • 0 comments

Is there an existing issue for this bug?

  • [X] I have searched the existing issues

🐛 Describe the bug

Posting this for documentation purposes. torch.compile has been reported to cause hang on exit when used with the "spawn" method in multiprocess. https://github.com/pytorch/pytorch/issues/115366 https://github.com/pytorch/pytorch/issues/126311 Upgrading to Python 3.10 may solve the problem (at least in my case). Currently, we can only push the Pytorch community to fix this🫠

Environment

PyTorch 2.1.1 + Python 3.9

Edenzzzz avatar Aug 10 '24 05:08 Edenzzzz