ColossalAI
ColossalAI copied to clipboard
[BUG]: stuck at initialization and no error message
🐛 Describe the bug
When parallel is set to pipeline=4 and tensor=dict(mode='2d', size=4), the program will get stuck on initialization and no error message will be output.
Environment
2*8 A100
Same bug at tp2pp4
same bug, looking forward a solution...The different thing is that I didn't use paraller config
which example is related to this bug?
which example is related to this bug?
gpt2 for the initial issue.
We have updated a lot. This issue was closed due to inactivity. Thanks.