Hui Zhou
Hui Zhou
test:mpich/ch4/most ✔️ test:mpich/ch3/most ✔️ test:mpich/ch4/gpu/ofi ❌ - (typo in testlist)
test:mpich/ch4/gpu/ofi EDIT: TIMED OUT. On my local computer it is `cuda-11.2`, which worked. It is `cuda-11.1` on Jenkins. I wonder whether that makes a difference.
test:mpich/ch4/gpu/ofi
test:mpich/ch4/most test:mpich/ch4/gpu/ofi
test:mpich/ch4/most
test:mpich/ch4/gpu/ofi
The wait kernel has too many issues. This PR passes my local testing but still times out on Jenkins. We should consider use stream memory operations (https://docs.nvidia.com/cuda/cuda-driver-api/group__CUDA__MEMOP.html). Nevertheless, this PR...
test:mpich/ch4/most test:mpich/ch4/gpu/ofi
A narrowed-down reproduce: ``` [0] ./allred2 -evenmemtype=device -oddmemtype=device [0] TEST MPI_COMM_WORLD [0] count = 2000 [1] TEST MPI_COMM_WORLD [1] count = 2000 [2] TEST MPI_COMM_WORLD [2] count = 2000 [3]...
Just tried the latest ucx master (commit 68fa8ee661deafc826716a72be88d629e5f41f38) and the test passed. Looking through the log, it is not clear which patch fixed it, but there were a few patches...