nano-vllm icon indicating copy to clipboard operation
nano-vllm copied to clipboard

Nano vLLM

Results 43 nano-vllm issues
Sort by recently updated
recently updated
newest added

I want to use the CPU for inference. Can it work? Is it possible to not install flash-attn?

* [ADD] Add TTFT, TPOT metrics. * [FIX] remove unnecessary dependencies. * [FIX] replace 'is False' to 'not'. * [FIX] modified some prompts. * [FIX] trailing blanks. * [Fix] redundant...