feat: Add BW measurement
Request BW : This metric is measured from the generation side for each request. It is be calculated as: Request BW = (Total KVCacheSize of Request)/(Total Time for all Generations to Complete)
@Shunkangz @Shixiaowei02 can you please review?
/bot run
PR_Github #505 [ run ] triggered by Bot
PR_Github #505 [ run ] completed with state FAILURE
/LLM/main/L0_MergeRequest_PR pipeline #434 completed with status: 'FAILURE'
/bot run
PR_Github #595 [ run ] triggered by Bot
PR_Github #595 [ run ] completed with state FAILURE
/LLM/main/L0_MergeRequest_PR pipeline #504 completed with status: 'FAILURE'
/bot run
PR_Github #655 [ run ] triggered by Bot
PR_Github #655 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #552 completed with status: 'SUCCESS'
Thank you for your contribution! @BatshevaBlack
/bot reuse-pipeline
PR_Github #665 [ reuse-pipeline ] triggered by Bot
PR_Github #665 [ reuse-pipeline ] completed with state SUCCESS
Reusing PR_Github #655 for commit 64e092b