adflow icon indicating copy to clipboard operation
adflow copied to clipboard

Testflo default NUM_PROCS causes tests to fail on centos with Intel compilers

Open joanibal opened this issue 5 years ago • 1 comments

Description

When the tests are run on Intel with testflo -v . or with testflo -v -n 5 (or greater) the tests encounter a bus error

Caught signal number 7 BUS: Bus Error, possibly illegal memory access

It is not well understood why this issue occurs, but it can easily be avoid by passing testflo the arguement -n followed by a number less than or equal to 4 (this maybe greater if your machine has more than 4 cores).

Steps to reproduce issue

Please provide a minimum working example (MWE) if possible

  1. run lastest centos intel docker image
  2. navigate to adflow folder and download the test input files
  3. run the tests with testflo .

Code versions

List versions only if relevant

  • Python 3
  • ADflow 2.2.1

joanibal avatar Oct 13 '20 21:10 joanibal

To add on to this, it appears to only affect the intel/impi build when testflo is oversubscribing. I cannot replicate this locally with my gcc build.

ewu63 avatar Oct 14 '20 14:10 ewu63