WMCore icon indicating copy to clipboard operation
WMCore copied to clipboard

Incorrect TotalEstimatedJobs Calculation

Open hassan11196 opened this issue 6 months ago • 3 comments

Impact of the bug ReqMgr2

Describe the bug The TotalEstimatedJobs calculation is not accurate and does not always correlate with the actual number of jobs created by the agents.

For Example:

For this ReReco ACDC1 Workflow, The TotalEstimatedJobs is 1,084,175 ~ 1 Million Jobs, while the actual number of jobs created is ~14K jobs, which we can see from wmstats. image

This number also correlates with the number of failed jobs for ACDC0 of this request i.e ~13K jobs, However, while creating ACDC, splitting was modified to 2x.

TLDR

ACDC1 TotalEstimatedJobs ~= 1 Million Jobs , while actual jobs created ~14K ACDC0 had 13K failed jobs and a total TotalEstimatedJobs 380K jobs, while actual jobs created ~18K. Original Workflow had 15K failed jobs 53K TotalEstimatedJobs Jobs, while actual jobs created ~37K.

TotalEstimatedJobs does not match actual number of jobs created.

How to reproduce it Not sure

Expected behavior TotalEstimatedJobs should match the actual number of jobs created.

Additional context and error message I found this discrepancy while implementing protection in Unified for ACDCs with a large amount of failed Jobs.

FYI @amaltaro @haozturk @anpicci

hassan11196 avatar Aug 06 '24 07:08 hassan11196