ServerlessLLM icon indicating copy to clipboard operation
ServerlessLLM copied to clipboard

Other hardware adaptation

Open mstJuly opened this issue 2 months ago • 2 comments

If I wish to adapt sllm for the Ascend NPU, what modifications should I undertake?

mstJuly avatar Nov 07 '25 09:11 mstJuly

Hi! Thanks for your interest in adapting SLLM for the Ascend NPU.

There’s already a version of SLLM that supports Ascend NPU available here: https://gitee.com/openeuler/ServerlessLLM . You can refer to that project for guidance and implementation details. If you encounter any issues or bugs, feel free to discuss them here — we’d be happy to help!

future-xy avatar Nov 10 '25 14:11 future-xy

Hi! Thanks for your interest in adapting SLLM for the Ascend NPU.

There’s already a version of SLLM that supports Ascend NPU available here: https://gitee.com/openeuler/ServerlessLLM . You can refer to that project for guidance and implementation details. If you encounter any issues or bugs, feel free to discuss them here — we’d be happy to help!

Attempted to run the project on a CUDA-compatible metax GPU. The demo ran successfully, but when testing DeepSeekOCR, sllm-store segfaulted in model.cpp while copying the model to the GPU. Not sure if this is due to DeepSeekOCR not being supported yet.

e1ijah1 avatar Nov 13 '25 03:11 e1ijah1