ompi
ompi copied to clipboard
Open MPI main development repository
Fixes the following: ** CID 1675024: Memory - illegal accesses (BUFFER_SIZE) ** CID 1675023: Null pointer dereferences (REVERSE_INULL) ** CID 1675022: Error handling issues (CHECKED_RETURN) ** CID 1675021: Memory -...
Hi [Update: the problem also appears for open-mpi-5.0.9] I am using cuda-aware open-mpi-5.0.9 I want to write applications that use OpenMPI to send data to different nodes with several gpu...
`coll/han` provides better latency than `coll/tuned` in cases where processes are mapped to nodes nonconsecutively, e.g., using `--rank-by node`. In that case `coll/han` reduces the amount of cross-node traffic. Its...
As an OpenMPI user, I noticed unexpected behavior when running MPI programs with Slurm’s `srun`. - **Environment:** `module load OpenMPI/5.0.3` - **What happens:** **Using `mpirun`** (works as expected) ``` Hello...
HPE's CrayMPI has a feature where during the call to MPI_Finalize, it can display or output to NIC-specific files a collection of network hardware performance counters and output them at...
Thanks for asking me to work on this. I will get started on it and keep this PR's description up to date as I form a plan and make progress....
Without a way to revoke the subcomms created by the coll modules, processes can get stuck waiting on sub-operations of a collective on a revoked communicator. I chose to put...
Tested on a Slingshot 11 cluster with synthetic failures (SIGTERM). I'm not sure how consistent the error code will be across different libfabric backends or types of faults. It may...
Do not try the -mcx16 flag if --disable-cx16-atomics is used, and prevent the generation of instructions that are not available on all x86 platforms (such as Celeron N4000). Always try...