[Feature] <如何基于deepspeed实现单机多卡或者多机多卡的推理？>

Open clintlong opened this issue 2 years ago • 2 comments

No response

有人实现了基于deepspeed框架进行多机多卡推理吗？

No response

Apr 19 '23 12:04 clintlong

一个gpu起一个Server，最后用nginx负载。

Apr 19 '23 17:04 wfuqiang1982

https://github.com/microsoft/Megatron-DeepSpeed 这个项目可以帮助deepspeed实现多卡多机推理，但需要改动模型

Apr 28 '23 16:04 sharlec