nano-vllm
nano-vllm copied to clipboard
[ADD] Add TTFT, TPOT metrics in tqdm bars.
- [ADD] Add TTFT, TPOT metrics.
- [FIX] remove unnecessary dependencies.
- [FIX] replace 'is False' to 'not'.
- [FIX] modified some prompts.
- [FIX] trailing blanks.
- [Fix] redundant variables.
TTFT is the "Time To First Token" metric, and TPOT is the "Time Per Output Token" metric. It will be seen as follows.