openvino.genai fix the caculation of performance metric

fix the caculation of performance metric

Open QuPengfei opened this issue 4 months ago • 0 comments

throughput/latency calculation issue when bs > 1. increase in unexpected way.

tm_list from the following should be the per token, not per batch. tm_list = np.array(perf_metrics.raw_metrics.m_durations) / 1000 / 1000

thanks, Pengfei

Oct 10 '24 07:10 QuPengfei