sys_reading icon indicating copy to clipboard operation
sys_reading copied to clipboard

DistServe: Disaggregating Prefill and Decoding for Goodput-optimized Large Language Model Serving

Open pentium3 opened this issue 11 months ago • 0 comments

https://arxiv.org/pdf/2401.09670v1.pdf

pentium3 avatar Mar 21 '24 06:03 pentium3