InternEvo
InternEvo copied to clipboard
[Feature] MoE模型里稠密层和专家层zero和并行的解耦
Describe the feature
MoE模型里稠密层和专家层zero和并行的解耦
Will you implement it?
- [ ] I would like to implement this feature and create a PR!