mpich
mpich copied to clipboard
feature: MPL_gpu_imemcpy is not implemented in CUDA
In PR https://github.com/pmodels/mpich/pull/6451, ipc read bcast and alltoall requires MPL_gpu_imemcpy to move the data. But this function is only implemented in mpl_gpu_ze.c (no implementation of this function in CUDA or HIP), causing the tests for ipc read bcast and alltoall (alltoall_gpu.c and bcast_gpu.c in test/mpi/coll) to fail.
In PR https://github.com/pmodels/mpich/pull/6733, ipc read allgather and allgatherv also requires MPL_gpu_imemcpy
Will someone from Intel implement the functionality?
I am not sure. Let me check with Maria.