nano-vllm icon indicating copy to clipboard operation
nano-vllm copied to clipboard

Can I use the CPU for inference?

Open beausoft opened this issue 2 weeks ago • 1 comments

I want to use the CPU for inference. Can it work? Is it possible to not install flash-attn?

beausoft avatar Nov 21 '25 06:11 beausoft