chaen
chaen
When the MQ server are restarted (and it may happen), the client then is blocked. It should reconnect. @wkrzemien can you please look at this urgently ? Thanks !
If a ForwardDISET operation fails (and always will), the REA will loop on it infinitely since there are no attempt counters (linked to the File table). We should find a...
I just keep it here for my memory, but I am not sure how we end up retrying URLs (as the logs seem to indicate) as the exception is caught...
This aims at solving https://github.com/DIRACGrid/DIRAC/issues/7116 It is quite a big change of logic, but should not impact anybody but LHCb, as it mostly matters for complex body plugin Also, it...
Looking into the performance of the `TransformationSystem`, and its DB in particular, the hotest spot is the `DataFiles` table. The aim of this table is to deduplicate the LFN in...
`getReplicasForJobs` (in the API/DataManager) returns only replicas that are allowed for job, i.e. on disk, not on failover, etc. However, the `RunningLimit` of a given site can be set to...
### Description `py-spy` has the nice option of dumping the content of the local variables and arguments when dumping a live program. This shows extremely useful to debug. Do you...
Hi, Lately, we have put the IN2P3 storage (dCache 9.2.6) under a lot of stress, leading to some instabilities. In practice, some `xroot` upload attempts from jobs running on worker...
Obviously very bad, as it seems to have resulted in multiple FTS transfers stepping on each other's toes ```python 2024-09-18 12:30:48 UTC DataManagement/FTS3Agent/FTS3DB ERROR: persistOperation: unexpected exception Traceback (most recent...
When a user is deleted from DIRAC, it should delete the entry in the DiracX sub (changche in LHCb)