Hui Zhou

Results 695 comments of Hui Zhou

I got the error: ``` Abort(894120335) on node 23 (rank 23 in comm 496): Fatal error in internal_Wait: Other MPI error, error stack: internal_Wait(72040)..........: MPI_Wait(request=0x8aed160, status=0x1) failed MPIR_Wait(741)................: MPIR_Wait_state(698)..........: MPIDI_progress_test(171)......:...

As usual, once I added debug `printf`s, the bug hides away :(

Now I couldn't get the error even without the `printf` :(

Okay, triggers the error 30% of the time. I think I got some clues now.

This is currently how pipeline works: Sender: 1. while (chunks_remain) issue_chunk_async_copy 2. for-any chunk async_copy done -> issue chunk send Receirver: 1. while (chunks_remain) issue chunk receive 2. for-any received...

Thanks @mathomp4 ! I'll take a look and incorporate your patch.

@mathomp4 I am not sure where `-pthread` is from. It is an option for some compiler. It doesn't appear on my system. Could you try - ``` find . -name...

Libevent is pulled in by hwloc Get Outlook for Android ________________________________ From: Matt Thompson ***@***.***> Sent: Friday, August 29, 2025 8:41:36 AM To: pmodels/mpich ***@***.***> Cc: Hui Zhou ***@***.***>; Assign...

> Libevent is pulled in by hwloc Actually, I probably was mistaken. I know `pmix` uses `libevent`. Libfabric may as well. @mathomp4 Could you do a `make clean` then `make...

> I understand that multiple threads cannot access the same stream index in parallel, That is the only restriction to use MPIX stream. However, you need to understand how MPI...