ColossalAI
ColossalAI copied to clipboard
[FEATURE]: Add more training models and RLHF algorithms
Describe the feature
Add more training models and RLHF algorithms for the branch grpo-latest.
Hi! I’m interested in working on this issue. Could you please assign it to me? Thanks!