mlc-llm icon indicating copy to clipboard operation
mlc-llm copied to clipboard

KV cache offloading to CPU RAM

Open shahizat opened this issue 11 months ago • 1 comments

Hello MLC-LLM team,

I would appreciate it if you could implement KV cache offloading in the near future. Thanks in advance!

shahizat avatar Nov 17 '24 21:11 shahizat