sys_reading icon indicating copy to clipboard operation
sys_reading copied to clipboard

Splitwise: Efficient Generative LLM Inference Using Phase Splitting

Open pentium3 opened this issue 1 year ago • 0 comments

https://arxiv.org/pdf/2311.18677.pdf

pentium3 avatar Feb 27 '24 07:02 pentium3