nano-vllm
nano-vllm copied to clipboard
Nano vLLM
Results
43
nano-vllm issues
Sort by
recently updated
recently updated
newest added
I want to use the CPU for inference. Can it work? Is it possible to not install flash-attn?
* [ADD] Add TTFT, TPOT metrics. * [FIX] remove unnecessary dependencies. * [FIX] replace 'is False' to 'not'. * [FIX] modified some prompts. * [FIX] trailing blanks. * [Fix] redundant...