lmdeploy
lmdeploy copied to clipboard
Moe bf16 ep
backends/moe.py and nn/moe.py has been refactored. Reuse token dispatcher in DLBlas