ColossalAI icon indicating copy to clipboard operation
ColossalAI copied to clipboard

[BUG]: Shardformer failure with torch 2.3

Open Edenzzzz opened this issue 1 year ago • 0 comments

Is there an existing issue for this bug?

  • [x] I have searched the existing issues

🐛 Describe the bug

Installing Pytorch 2.3, which is required by the newest version of xformers, can cause shardformer to fail for obscure reasons. This issue is opened for tracking purposes and to discourage the use of torch 2.3 for now.

To reproduce:

python tests/test_optimizer/test_dist_lamb.py image

Environment

PyTorch 2.3

Edenzzzz avatar May 27 '24 08:05 Edenzzzz