clusterdata icon indicating copy to clipboard operation
clusterdata copied to clipboard

Why does evaluator for an inference job consume so much time in the cluster-trace-gpu-v2020?

Open cashey opened this issue 2 years ago • 0 comments

1.as shown in the picture"evaluator" is for inference job ,and the "runtime" is giant: 1695265375378 2.in the paper(MLaaS in the Wild: Workload Analysis and Scheduling in Large-Scale Heterogeneous GPU Clusters),Figure 4a,the taskrun time is also begin 10s image inference job such as Image classification do not need 10s, so, there is no any such job in the cluster? and what is the job consume so much time ?

thank you very much!

cashey avatar Sep 21 '23 03:09 cashey