h2o-llmstudio
h2o-llmstudio copied to clipboard
[FEATURE] MoE Aux Loss
🚀 Feature
Implement MoE Aux Loss for Router
https://github.com/huggingface/transformers/blob/v4.36.1/src/transformers/models/mixtral/modeling_mixtral.py#L76