sglang
sglang copied to clipboard
[ROCm] Optimal MOE Tuning for AMD Radeon Graphics
Motivation
Optimal MOE tuning for AMD Radeon Graphics Card
Modifications
Modify MOE configs for AMD Radeon Graphics
Checklist
- [ ] Format your code according to the Code Formatting with Pre-Commit.
- [ ] Add unit tests as outlined in the Running Unit Tests.
- [ ] Update documentation / docstrings / example tutorials as needed, according to Writing Documentation.
- [ ] Provide throughput / latency benchmark results and accuracy evaluation results as needed, according to Benchmark and Profiling and Accuracy Results.
- [ ] For reviewers: If you haven't made any contributions to this PR and are only assisting with merging the main branch, please remove yourself as a co-author when merging the PR.
- [ ] Please feel free to join our Slack channel at https://slack.sglang.ai to discuss your PR.
cc @HaiShaw verified for AMD Radeon Cards