sys_reading icon indicating copy to clipboard operation
sys_reading copied to clipboard

Fast Distributed Inference Serving for Large Language Models

Open pentium3 opened this issue 11 months ago • 1 comments

https://arxiv.org/pdf/2305.05920.pdf

pentium3 avatar Mar 08 '24 01:03 pentium3

https://zhuanlan.zhihu.com/p/648759542

pentium3 avatar Mar 08 '24 01:03 pentium3