ColossalAI
ColossalAI copied to clipboard
[shardformer] Support Qwen2 model
🚨 Issue number
- [ ] https://github.com/hpcaitech/ColossalAI/issues/5573
📝 What does this PR do?
[shardformer/modeling/qwen2]: add qwen2.py and qwen2 policy to support qwen2 model, have passed all the tests in test_shard_qwen2.py.
The environment is transformers==4.39.3, as 4.36.0 transformers didn't support qwen2 model.