ColossalAI
ColossalAI copied to clipboard
[shardformer]support gradients accumulation fusion