moose icon indicating copy to clipboard operation
moose copied to clipboard

Introduce OpenMPI Conda Packages

Open milljm opened this issue 1 year ago • 3 comments

Reason

MPICH 4.1.x seems to be a no go for Apple Si. When we try to bump MPICH from 4.0.2 (working), to anything newer (4.1.x or higher), we see random and sometimes not-so-random hangs when running MOOSE based applications. I am not exactly sure what causes the hang, but we believe it is occurring in MUMPS.

Design

Adding OpenMPI as a possible solution; Core MOOSE developers are not sure how to make MPICH work. Whether or not this gets accepted, I want a PR Conda channel to play with.

Will need to figure out versioner to begin tracking another wrapper, and everything that might entail.

Possibly create a moose-mpi package instead, allowing for variants: conda install moose-dev openmpi would get you the OpenMPI stack variant, while conda install moose-dev mpich would get you MPICH.

The default (conda install moose-dev) would end up being whatever latest packages Conda finds (eventually resulting in only OpenMPI).

The variant idea is partially working when using the custom channel:

conda config --add channels https://conda.software.inl.gov/moose/dualmpi

Apple Si only at the moment

Impact

Switch MPI Wrapper from MPICH to OpenMPI

milljm avatar Feb 19 '24 17:02 milljm

The hangs are definitely in MUMPS

lindsayad avatar Feb 19 '24 18:02 lindsayad

I have my work cut out for me... This change will require a substantial re-write on our versioner.py tool. At a glance anyway.

milljm avatar Feb 20 '24 18:02 milljm

Now to mess with Civet recipes live...

milljm avatar Feb 20 '24 23:02 milljm