MoE-Infinity icon indicating copy to clipboard operation
MoE-Infinity copied to clipboard

Does it support other DeepSeek models?

Open wuooo339 opened this issue 9 months ago • 4 comments

I want to inference other DeepSeek models in V100 GPU.Does it support?Such as deepseek-ai's DeepSeek-R1-Distill-Llama-70B or DeepSeek-R1-Distill-Qwen-32B?

wuooo339 avatar Mar 18 '25 08:03 wuooo339

These two models are llama and qwen by themselves but distilled using R1, we plan to support Qwen later

drunkcoding avatar Mar 18 '25 14:03 drunkcoding

@drunkcoding Dose it support python3.12?

wuooo339 avatar Mar 19 '25 03:03 wuooo339

We haven't tested yet, but given Python3 is is backward compatible, it should work. You might need to buiild wheel by yourself form source using BUILD_OPS=1 python3 -m build

drunkcoding avatar Mar 19 '25 13:03 drunkcoding

Dose it support deepseek-moe-16b-chat? Thanks

TheLogan6 avatar Mar 25 '25 07:03 TheLogan6