OpenCoarrays icon indicating copy to clipboard operation
OpenCoarrays copied to clipboard

Defect: Test failures with openmpi-4.1.x: ***Failed Required regular expression not found. Regex=[Test passed.]

Open band-a-prend opened this issue 3 years ago • 1 comments

The title of the issue should start with Defect: followed by a succinct title.

Please make sure to put any logs, terminal output, or code in fenced code blocks. Please also read the contributing guidelines before submitting a new issue.

Please note we will close your issue without comment if you delete, do not read or do not fill out the issue checklist below and provide ALL the requested information.

  • [ x] I am reporting a bug others will be able to reproduce and not asking a question or requesting a new feature.

System information including:

  • OpenCoarrays Version: $ caf --version OpenCoarrays Coarray Fortran Compiler Wrapper (caf version 2.10.1) Copyright (C) 2015-2022 Sourcery Institute Copyright (C) 2015-2022 Archaeologic Inc.

  • Fortran Compiler: gfortran (gcc 11.3.0)

  • C compiler used for building lib: gcc 11.3.0

  • Installation method: Gentoo portage (package manager)

  • All flags & options passed to the installer Build type RelWithDebInfo Install path /usr Compiler flags: C -march=native -O2 -pipe -Werror=odr -Werror=lto-type-mismatch -Werror=strict-aliasing -fno-working-directory -fdebug-prefix-map="/var/tmp/portage/sys -cluster/opencoarrays-2.10.1/work/OpenCoarrays-2.10.1"=. C++
    Linker flags: Executable -Wl,-O1 -Wl,--as-needed Module -Wl,-O1 -Wl,--as-needed Shared -Wl,-O1 -Wl,--as-needed

  • Output of uname -a: Linux 5.15.63-gentoo-dist SMP Thu Aug 25 12:40:44 -00 2022 x86_64 AMD Phenom(tm) II X4 955 Processor AuthenticAMD GNU/Linux

  • MPI library being used: openmpi-4.1.2, openmpi-4.1.4

  • Machine architecture and number of physical cores: x86_64 AMD Phenom(tm) II X4 955, 4 cores

  • Version of CMake: 3.23.3

To help us debug your issue please explain:

What you were trying to do (and why)

Tried to run tests after build against openmpi instead of mpich on attempt to packaging new OpenCoarrays version 2.10.1.

What happened (include command output, screenshots, logs, etc.)

Some tests are failed with error ***Failed Required regular expression not found. Regex=[Test passed.]

The following tests FAILED:
          6 - allocate_as_barrier_proc (Failed)
         10 - async_comp_alloc (Failed)
         15 - team_number (Failed)
         18 - teams_coarray_get (Failed)
         19 - teams_coarray_get_by_ref (Failed)
         20 - teams_coarray_send (Failed)
         21 - teams_coarray_send_by_ref (Failed)
         22 - teams_coarray_sendget (Failed)
         23 - sync_team (Failed)
         53 - syncimages_status (Failed)
         55 - sync_ring_abort_np7 (Failed)
         56 - simpleatomics (Failed)
         57 - syncall (Failed)
         58 - syncimages (Failed)
         59 - syncimages2 (Failed)
         60 - duplicate_syncimages (Failed)
         65 - source-alloc-no-sync (Failed)
         73 - issue-493-coindex-slice (Failed)
         74 - issue-488-multi-dim-cobounds-true (Failed)
         75 - issue-488-multi-dim-cobounds-false (Failed)

Full build and test log

What you expected to happen

All tests are passed like if build and run tests for OpenCoarrays 2.10.1 built against mpich-3.4.3. (Full build and successfully test log if build against mpich )

The OpenCoarrays of 2.9.3 and 2.10.0 successfully pass all test being built against openmpi-4.1.2 and 4.1.4.

Step-by-step reproduction instructions to reproduce the error/bug

  • Install openmpi-4.1.2 or 4.1.4 instead of mpich - for previous versions of OpenCoarrays it was sufficient to install openmpi with next built-in options: sys-cluster/openmpi-4.1.4::gentoo USE="fortran -cma -cuda -cxx -ipv6 -java -libompitrace -peruse -romio" where - means that option is switched off.
  • Build OpenCoarrays 2.10.1
  • Run tests

band-a-prend avatar Oct 13 '22 21:10 band-a-prend

The changes introduced in https://github.com/sourceryinstitute/OpenCoarrays/commit/ed01646a22ad4ac02900516ed6eb6ec1c759f188 results in this problem when openmpi is used.

band-a-prend avatar Oct 14 '22 13:10 band-a-prend