vlimant
vlimant
is there a way (grafana or whatnot) to monitor where things are queuing for specific workflows/tasks ?
is there any way we can indeed relax the merge job restriction at sites that then end up being merge bottleneck to get workflow out and clearing from the system...
assign generators
@Kiarendil While not a showstopper, we need to figure out why we need to throw away 1M CPUh into running this workflow and generator configuration. Throwing failures at computing operation...
the amount of job failures https://cms-unified.web.cern.ch/cms-unified/report/cmsunified_task_BPH-Run3Summer22EEGS-00172__v1_T_250416_160139_6446 is horrendous
somewhat related. ;at least it’s the same error message coming from https://cms-unified.web.cern.ch/cms-unified/report/cmsunified_task_HIG-Run3Summer22EEGS-00037__v1_T_250517_210739_6514 ``` Fatal Exception (Exit code: 8001) An exception of category 'ExternalFailed' occurred while [0] Processing Event run: 1...
regarding "unreasonably high value", I think you are mistaken. The value is set very reasonably so that the output dataset has more events than lumisection (preventing the lumi branches and...
Any update on this end? We find ourselves in the same situation that triggered this request in the first place (producing two MINIAODSIM output from the same workflow) again. I...
thanks for the positive feedback ; we'll try it out with the output renaming
can you please help out with https://its.cern.ch/jira/browse/CMSPROD-264 to figure an apriory check on TaskChain that would prevent a StepChain conversion ? I believe one can look into the splitting document...