Hui Zhou
Hui Zhou
test:mpich/pmi test:mpich/ch3/most test:mpich/ch4/most
test:mpich/pmi
Which MPICH release are you referring? If you are using mpich-3.2, you may try `--with-device=ch3:nemesis:ofi`. `ch4:ucx` should be available in mpich-3.3 and newer.
@cs0731 ch3 is in maintenance mode. Is it possible for you try ch4 using later MPICH releases?
The comment doesn't make sense and probably should be deleted. The code it commented on is just adding up local sizes. It is likely the "allreduce" comment was meant on...
We use `tm_spawn` to launch `hydra_pmi_proxy`, which seems to always launch into a new process session. I guess we need check whether the proxy is on the same node as...
Yeah, we can fix it. Thanks for reporting.
Hmm, I couldn't reproduce the issue: ``` $ mpiexec -l -n 2 ./mpitest [0] About to call MPI_Init [1] About to throw 0 [1] terminate called after throwing an instance...
I confirm that if one process `return` we will hang. Let me see if I can fix that.
The code in question is here - https://github.com/pmodels/mpich/blob/e09f4cd9df7a1f238f9eee17a5f0fb80bc16ea52/src/pm/hydra/proxy/pmip_cb.c#L308 If we remove the `HYD_pmcd_pmip.downstream.pmi_fd_active[pid]` condition, then the exit without `PMI_Init` will trigger the same cleanup behavior. We need discuss within the...