ColossalAI icon indicating copy to clipboard operation
ColossalAI copied to clipboard

[feature] support Gemma2Model for tensor parallem training

Open jing-4369 opened this issue 1 year ago • 1 comments

📌 Checklist before creating the PR

  • [x] I have created an issue for this PR for traceability
  • [x] The title follows the standard format: [doc/gemini/tensor/...]: A concise description
  • [x] I have added relevant tags if possible for us to better distinguish different PRs
  • [ ] I have installed pre-commit: pip install pre-commit && pre-commit install

🚨 Issue number

fixed #6120

📝 What does this PR do?

support Gemma2Model for tensor parallem training

Attached here is a small bug fix to successfully run the llama model

image

💥 Checklist before requesting a review

  • [x] I have linked my PR to an issue (instruction)
  • [x] My issue clearly describes the problem/feature/proposal, with diagrams/charts/table/code if possible
  • [x] I have performed a self-review of my code
  • [ ] I have added thorough tests.
  • [ ] I have added docstrings for all the functions/methods I implemented

⭐️ Do you enjoy contributing to Colossal-AI?

  • [x] 🌝 Yes, I do.
  • [ ] 🌚 No, I don't.

jing-4369 avatar Nov 09 '24 13:11 jing-4369

Thanks for contributing! To add a new model, we will also need unit tests. Please reference the existing tests and feel free to ping other team members.

Edenzzzz avatar Nov 11 '24 18:11 Edenzzzz