MoE-Infinity
MoE-Infinity copied to clipboard
TODO for first release
- [x] API design
- [x] Document for installation and PyPI
- [x] performance table
- [x] Support Mixtral multi-GPU
- [ ] Load trace