rcps-buildscripts icon indicating copy to clipboard operation
rcps-buildscripts copied to clipboard

Install & Repair Request: ANSYS 2023.R1/2, 2024.R1

Open ikirker opened this issue 10 months ago • 19 comments

Application: ANSYS 2023.R1 and R2

Cluster: Myriad & Kathleen at least

Description: does fluid dynamics and other macroscopic physics simulations

License: proprietary

Ticket number: IN:00000000

ANSYS 2023.R1 is installed, but currently has some remaining problems: it seems the MPI wrappers do not correctly export the environment? This is causing errors where the Fluent binary cannot find either libmpi_wrapper.so or libansysfluidshdfsdk.so (or presumably other libraries it would load after that).

ANSYS 2023.R2 is not currently installed. I assume that this problem is likely, but not certain, to show up in 2023.R2 as well.

To reproduce the problem:

fluent 3d -g -mpitest

You may see Tcl errors, but also this error:

/shared/ucl/apps/ANSYS/2023.R1/v231/fluent/fluent23.1.0/multiport/mpi_wrapper/test/lnamd64/test: error while loading shared libraries: libmpi_wrapper.so: cannot open shared object file: No such file or directory

If you run Fluent without the option to test, the problem manifests as this error instead:

/shared/ucl/apps/ANSYS/2023.R1/v231/fluent/fluent23.1.0/lnamd64/3d_node/fluent_mpi.23.1.0: error while loading shared libraries: libansysfluidshdfsdk.so: cannot open shared object file: No such file or directory

Here are the Tcl errors I see, but they only appear on Kathleen; maybe a difference in Tcl libs installed in the OS.

$ fluent 3d -g -mpitest
/shared/ucl/apps/ANSYS/2023.R1/v231/fluent/fluent23.1.0/bin/fluent -r23.1.0 3d -g -mpitest
application-specific initialization failed: Can't find a usable init.tcl in the following directories: 
    /home2/polyflowbuilds/buildagents/milpolybld01/work/1210bc2a0080eb69/Tcl-tk/8_5_11/linx64/Release/tcltk/lib/tcl8.5 /usr/lib/tcl8.5 /lib/tcl8.5 /usr/library /library /tcl8.5.11/library /tcl8.5.11/library



This probably means that Tcl wasn't installed properly.

I think the setup for the MPI environment is done in:

/shared/ucl/apps/ANSYS/2023.R1/v231/CEI/apex231/machines/linux_2.6_64/multiport/mpi_wrapper/bin/mpirun.fl

but I haven't had time to look much further into it yet.

ikirker avatar Oct 20 '23 12:10 ikirker

@balston noted that it is necessary to use a non-default MPI, instead, e.g.:

fluent 3ddp -t$NSLOTS -mpi=intel2018 -cnf=$TMPDIR/machines -g < your-input-file

Is it feasible to change this to be the new default?

The MPI test using this option passes successfully within a single node:

fluent 3d -g -mpi=intel2018 -t 2 -mpitest

ikirker avatar Oct 20 '23 12:10 ikirker

ANSYS Mechanical from 2023.R1 doesn't work on Kathleen (and possibly Myriad) as it picks up the wrong version of Tcl (the one from /usr/bin). This causes jobs to fail with errors like:

application-specific initialization failed: Can't find a usable init.tcl in the following directories: 
    /usr/share/tcl8.5 /home2/polyflowbuilds/buildagents/milpolybld01/work/1210bc2a0080eb69/Tcl-tk/8_5_11/linx64/Release/tcltk/lib/tcl8.5 /usr/lib/tcl8.5 /lib/tcl8.5 /usr/library /library /tcl8.5.11/library /tcl8.5.11/library

/usr/share/tcl8.5/init.tcl: version conflict for package "Tcl": have 8.5.11, need exactly 8.5.13
version conflict for package "Tcl": have 8.5.11, need exactly 8.5.13
    while executing
"package require -exact Tcl 8.5.13"
    (file "/usr/share/tcl8.5/init.tcl" line 18)
    invoked from within
"source /usr/share/tcl8.5/init.tcl"
    ("uplevel" body line 1)
    invoked from within
"uplevel #0 [list source $tclfile]"


This probably means that Tcl wasn't installed properly.

balston avatar Dec 14 '23 11:12 balston

ANSYS 2024.R1 is now available so probably worth attempting to install this version.

balston avatar Mar 15 '24 11:03 balston