rcps-buildscripts icon indicating copy to clipboard operation
rcps-buildscripts copied to clipboard

Install & Repair Request: ANSYS 2023.R1/2, 2024.R1

Open ikirker opened this issue 1 year ago • 19 comments

Application: ANSYS 2023.R1 and R2

Cluster: Myriad & Kathleen at least

Description: does fluid dynamics and other macroscopic physics simulations

License: proprietary

Ticket number: IN:00000000

ANSYS 2023.R1 is installed, but currently has some remaining problems: it seems the MPI wrappers do not correctly export the environment? This is causing errors where the Fluent binary cannot find either libmpi_wrapper.so or libansysfluidshdfsdk.so (or presumably other libraries it would load after that).

ANSYS 2023.R2 is not currently installed. I assume that this problem is likely, but not certain, to show up in 2023.R2 as well.

To reproduce the problem:

fluent 3d -g -mpitest

You may see Tcl errors, but also this error:

/shared/ucl/apps/ANSYS/2023.R1/v231/fluent/fluent23.1.0/multiport/mpi_wrapper/test/lnamd64/test: error while loading shared libraries: libmpi_wrapper.so: cannot open shared object file: No such file or directory

If you run Fluent without the option to test, the problem manifests as this error instead:

/shared/ucl/apps/ANSYS/2023.R1/v231/fluent/fluent23.1.0/lnamd64/3d_node/fluent_mpi.23.1.0: error while loading shared libraries: libansysfluidshdfsdk.so: cannot open shared object file: No such file or directory

Here are the Tcl errors I see, but they only appear on Kathleen; maybe a difference in Tcl libs installed in the OS.

$ fluent 3d -g -mpitest
/shared/ucl/apps/ANSYS/2023.R1/v231/fluent/fluent23.1.0/bin/fluent -r23.1.0 3d -g -mpitest
application-specific initialization failed: Can't find a usable init.tcl in the following directories: 
    /home2/polyflowbuilds/buildagents/milpolybld01/work/1210bc2a0080eb69/Tcl-tk/8_5_11/linx64/Release/tcltk/lib/tcl8.5 /usr/lib/tcl8.5 /lib/tcl8.5 /usr/library /library /tcl8.5.11/library /tcl8.5.11/library



This probably means that Tcl wasn't installed properly.

I think the setup for the MPI environment is done in:

/shared/ucl/apps/ANSYS/2023.R1/v231/CEI/apex231/machines/linux_2.6_64/multiport/mpi_wrapper/bin/mpirun.fl

but I haven't had time to look much further into it yet.

ikirker avatar Oct 20 '23 12:10 ikirker

@balston noted that it is necessary to use a non-default MPI, instead, e.g.:

fluent 3ddp -t$NSLOTS -mpi=intel2018 -cnf=$TMPDIR/machines -g < your-input-file

Is it feasible to change this to be the new default?

The MPI test using this option passes successfully within a single node:

fluent 3d -g -mpi=intel2018 -t 2 -mpitest

ikirker avatar Oct 20 '23 12:10 ikirker

ANSYS Mechanical from 2023.R1 doesn't work on Kathleen (and possibly Myriad) as it picks up the wrong version of Tcl (the one from /usr/bin). This causes jobs to fail with errors like:

application-specific initialization failed: Can't find a usable init.tcl in the following directories: 
    /usr/share/tcl8.5 /home2/polyflowbuilds/buildagents/milpolybld01/work/1210bc2a0080eb69/Tcl-tk/8_5_11/linx64/Release/tcltk/lib/tcl8.5 /usr/lib/tcl8.5 /lib/tcl8.5 /usr/library /library /tcl8.5.11/library /tcl8.5.11/library

/usr/share/tcl8.5/init.tcl: version conflict for package "Tcl": have 8.5.11, need exactly 8.5.13
version conflict for package "Tcl": have 8.5.11, need exactly 8.5.13
    while executing
"package require -exact Tcl 8.5.13"
    (file "/usr/share/tcl8.5/init.tcl" line 18)
    invoked from within
"source /usr/share/tcl8.5/init.tcl"
    ("uplevel" body line 1)
    invoked from within
"uplevel #0 [list source $tclfile]"


This probably means that Tcl wasn't installed properly.

balston avatar Dec 14 '23 11:12 balston

ANSYS 2024.R1 is now available so probably worth attempting to install this version.

balston avatar Mar 15 '24 11:03 balston

There is a request to install ANSYS 2024 R1 on Kathleen (IN06665389).

cdkharris avatar Jun 11 '24 13:06 cdkharris

This should be possible. I'll download the ANSYS 2024 installers from behind the license portal and upload them to Kathleen.

balston avatar Jun 12 '24 11:06 balston

I've downloaded the ANSYS 2024.R1 installer archives to my laptop (all 45 GB worth!), done checksum checks and will upload them to Kathleen tomorrow.

balston avatar Jun 12 '24 17:06 balston

Thew ANSYS 2024.R1 installer archives are now in:

/shared/ucl/apps/ANSYS/installers/2024.R1

on Kathleen and their checksums have been checked against the ANSYS Inc published ones.

balston avatar Jun 14 '24 11:06 balston

I have the ANSYS 2024 R1 install script ready to run on Kathleen.

balston avatar Jun 18 '24 10:06 balston

The install script is running as ccspapp:

./ANSYS-2024R1_install 2>&1 | tee ~/Software/ANSYS/ANSYS-2024R1_install.log-1

and its running really slowly.

balston avatar Jun 18 '24 13:06 balston

The install hung during the ANSYS Electromagnetics installer yesterday so I'm re-running from this one now.

balston avatar Jun 19 '24 11:06 balston

The ANSYS Electromagnetics installer ran to completion yesterday afternoon and I've now run the additional update install as well. I will now add a module file and run some test jobs.

balston avatar Jun 20 '24 12:06 balston

my ANSYS CFX test job has run successfully over 80 cores at the second attempt. Submitting a Fluent test job next.

balston avatar Jun 20 '24 15:06 balston

Fluent test job ran successfully overnight. Now submitting a AnsysEM test job.

balston avatar Jun 21 '24 09:06 balston

my AnsysEM test job has run successfully

balston avatar Jun 21 '24 11:06 balston

I've uploaded the module file and replied to IN06665389

balston avatar Jun 21 '24 14:06 balston

Will need to install on Myriad as well.

balston avatar Jun 21 '24 14:06 balston

ANSYS 2024 R1 now installed my Myriad. Need to test next.

balston avatar Jul 02 '24 12:07 balston

CFX and Fluent test jobs run successfully on Myriad with 2024 R1

balston avatar Jul 03 '24 12:07 balston

ANSYS Mechanical test job failing with:

OMP: Error #100: Fatal system error detected.
OMP: System error #22: Invalid argument
forrtl: error (76): Abort trap signal

suggestions online suggest switch to an alternative MPI. ANSYS now provide an OpenMPI so will try that.

balston avatar Jul 03 '24 16:07 balston