WMCore
WMCore copied to clipboard
Incorrect TotalEstimatedJobs Calculation
Impact of the bug ReqMgr2
Describe the bug
The TotalEstimatedJobs
calculation is not accurate and does not always correlate with the actual number of jobs created by the agents.
For Example:
For this ReReco ACDC1 Workflow, The TotalEstimatedJobs
is 1,084,175 ~ 1 Million Jobs, while the actual number of jobs created is ~14K jobs, which we can see from wmstats.
This number also correlates with the number of failed jobs for ACDC0 of this request i.e ~13K jobs, However, while creating ACDC, splitting was modified to 2x.
TLDR
ACDC1 TotalEstimatedJobs
~= 1 Million Jobs , while actual jobs created ~14K
ACDC0 had 13K failed jobs and a total TotalEstimatedJobs
380K jobs, while actual jobs created ~18K.
Original Workflow had 15K failed jobs 53K TotalEstimatedJobs
Jobs, while actual jobs created ~37K.
TotalEstimatedJobs
does not match actual number of jobs created.
How to reproduce it Not sure
Expected behavior
TotalEstimatedJobs
should match the actual number of jobs created.
Additional context and error message I found this discrepancy while implementing protection in Unified for ACDCs with a large amount of failed Jobs.
FYI @amaltaro @haozturk @anpicci