Hui Zhou

Results 695 comments of Hui Zhou

> Looking at the `manyrma2` failure. I think it is just a time-out due to active message path being too slow. It is not related to your PR. Could you...

> I have updated the commit messages and rebased on the latest `main`. The results in the PR is also updated. Thanks! The difference between enable and disable `TOPO_ENABLE` on...

test:mpich/custom netmod: ch4:ucx config: hcoll Hangs during init: ``` Thread 1 "cpi" received signal SIGINT, Interrupt. 0x00007ffff5d13e93 in progress () at src/mpid/common/hcoll/hcoll_rte.c:61 61 } #0 0x00007ffff5d13e93 in progress () at...

test:mpich/custom netmod: ch4:ucx config: hcoll

test:mpich/ch3/most test:mpich/ch4/most

test:mpich/custom config: mpi-abi ✔️

> @hzhou mpi4py still broken, same error. Added a fix -- I am getting sloppy 🫠. Could you try again?

test:mpich/ch4/xpmem test:mpich/ch4/gpu/ofi test:mpich/ch4/ucx xpmem failures: ![image](https://github.com/pmodels/mpich/assets/1496702/ae7727af-1eee-4d27-af7f-8f7bb04fbaf8) Timeouts are sockets provider performance issue, unrelated to xpmem. The threadcomm failure will be addressed here - https://github.com/pmodels/mpich/pull/6579/commits/4455a5a7c21fca965f67a1bd3494fd093b49c360

@raffenet The xpmem test failures are unrelated and will be addressed in #6579. Could you review this one first?

We need to add the mpi-abi config into our CI review workflow