jeyserma
jeyserma
A few weeks ago I did some timing/efficiency studies for repacking as a function of the #threads, comparing the old and new compression algorithms. See the plots below for reference,...
We have another paused job for the muon PD with exceeding memory. The tarball can be found here: ``` /eos/home-c/cmst0/public/PausedJobs/Run2024G/maxPSS/PromptReco_Run386319_Muon1 ``` I copied the RAW input file to the following...
Oh sorry, I mixed up the job tarballs of a different paused job. Please ignore my previous comment about the files. The one you analyzed (run 386037) was a cosmic...
I found the correct tarball + RAW file and copied them over to the same location (removed the old files): ``` /eos/home-c/cmst0/public/PausedJobs/Run2024G/maxPSS/PromptReco_Run386319_Muon1 ``` The maxPSS error is visible in the...
Two more occurred and are reported here: ``` /eos/home-c/cmst0/public/PausedJobs/Run2024G/maxPSS/PromptReco_Run386694_Muon0/ ``` The RAW files are also copied over.
maxPSS paused jobs are automatically retried 3 times by our agent. For this particular memory issue, we increased the memory limit to 17 GB (default 16 GB for 8 cores),...
FYI: We resumed the 300 paused jobs with 5 streams (keeping 8 threads), and the majority finished successfully. Only 30 jobs (10%) failed again due to similar maxPSS issues. We...