ColossalAI
ColossalAI copied to clipboard
[feature] support Gemma2Model for tensor parallem training
📌 Checklist before creating the PR
- [x] I have created an issue for this PR for traceability
- [x] The title follows the standard format:
[doc/gemini/tensor/...]: A concise description - [x] I have added relevant tags if possible for us to better distinguish different PRs
- [ ] I have installed pre-commit:
pip install pre-commit && pre-commit install
🚨 Issue number
fixed #6120
📝 What does this PR do?
support Gemma2Model for tensor parallem training
Attached here is a small bug fix to successfully run the llama model
💥 Checklist before requesting a review
- [x] I have linked my PR to an issue (instruction)
- [x] My issue clearly describes the problem/feature/proposal, with diagrams/charts/table/code if possible
- [x] I have performed a self-review of my code
- [ ] I have added thorough tests.
- [ ] I have added docstrings for all the functions/methods I implemented
⭐️ Do you enjoy contributing to Colossal-AI?
- [x] 🌝 Yes, I do.
- [ ] 🌚 No, I don't.
Thanks for contributing! To add a new model, we will also need unit tests. Please reference the existing tests and feel free to ping other team members.