nano-vllm
nano-vllm copied to clipboard

Published 20 hours ago •

Reame
Issues

Can I use the CPU for inference?

Open beausoft opened this issue 2 weeks ago • 1 comments

I want to use the CPU for inference. Can it work? Is it possible to not install flash-attn?

Nov 21 '25 06:11 beausoft