ColossalAI-Examples
ColossalAI-Examples copied to clipboard
grad is none when run gpt2 with pipeline parallelism only
🐛 Describe the bug

Environment
torch==1.12.0a0+8a1a93a num_gpu=4 pipeline=4 model = gpt2-small