tensorrtllm_backend icon indicating copy to clipboard operation
tensorrtllm_backend copied to clipboard

llm performance metric

Open weibingo opened this issue 10 months ago • 0 comments

tensorrtllm have the metric of llm performance , e.g TTFT, latency of token , input token, output token。 if have, please tell me; if not , can add the metric like vllm metric。the metric is very important。

weibingo avatar Dec 11 '24 06:12 weibingo