ompi icon indicating copy to clipboard operation
ompi copied to clipboard

[Finalize] Del_proc also spawned processes that have different jobids

Open EmmanuelBRELLE opened this issue 6 months ago • 1 comments

PML cleanup may be incomplete with processes with different name. This happens for example with spawned processes

EmmanuelBRELLE avatar Oct 15 '25 14:10 EmmanuelBRELLE

I understand the issue you raised but I don't think this is the correct solution. You are now removing all peers from the PML, even those that belong to other sessions.

A quick scan through the code seems to indicate that the same logic is used when sessions are created (use ompi_proc_get_allocated to get the list of processes to be added). This seems to indicate that this simple fix is not the right solution to the problem.

That's a good point, thanks. To fix this issue, the new proposal is to keep track of the list of spawned jobids related to the instance (=session). Only those jobids will be freed at instance finalize.

EmmanuelBRELLE avatar Dec 08 '25 16:12 EmmanuelBRELLE