ColossalAI-Examples
ColossalAI-Examples copied to clipboard
grad is none when run gpt2 with pipeline parallelism only
trafficstars
🐛 Describe the bug
Environment
torch==1.12.0a0+8a1a93a num_gpu=4 pipeline=4 model = gpt2-small