pika icon indicating copy to clipboard operation
pika copied to clipboard

Several tests fail on FreeBSD

Open yurivict opened this issue 1 year ago • 8 comments

Actual Behavior

log

Steps to Reproduce the Problem

Running tests according to the docs.

  • pika version (commit/tag): 0.21.0
  • Platform (OS, compiler, etc.): FreeBSD 13.2, clang-16

yurivict avatar Dec 11 '23 21:12 yurivict

Could you please run ctest --output-on-failure? tests.unit.modules.runtime.process_mask_flag is expected to fail if it's not run on a particular type of system (with two hyperthreads per core, and we've been lazy in skipping it if the hardware isn't as expected). The others may be real, system specific problems. Any help you can provide in debugging them would be appreciated though.

msimberg avatar Dec 12 '23 09:12 msimberg

Here is the log for the run of ctest --output-on-failure.

tests.unit.modules.runtime.process_mask_flag is expected to fail if it's not run on a particular type of system (with two hyperthreads per core, and we've been lazy in skipping it if the hardware isn't as expected).

This run is on the system with two hyperthreads per core.

yurivict avatar Dec 12 '23 10:12 yurivict

This run is on the system with two hyperthreads per core.

Interesting... thank you. I'll have a look through the log.

msimberg avatar Dec 12 '23 11:12 msimberg

@yurivict are you able to provide lstopo output from the system you're testing on? As for the segfaults, it's hard to say what might be going wrong there. If you're able to provide backtraces from the failing tests that may help.

msimberg avatar Dec 19 '23 10:12 msimberg

$ sudo lstopo
Failed to initialize LevelZero in ze_init(): 0x78000001
Machine (23GB total)
  Package L#0
    NUMANode L#0 (P#0 23GB)
    L3 L#0 (8192KB)
      L2 L#0 (256KB) + L1d L#0 (32KB) + L1i L#0 (32KB) + Core L#0
        PU L#0 (P#0)
        PU L#1 (P#1)
      L2 L#1 (256KB) + L1d L#1 (32KB) + L1i L#1 (32KB) + Core L#1
        PU L#2 (P#2)
        PU L#3 (P#3)
      L2 L#2 (256KB) + L1d L#2 (32KB) + L1i L#2 (32KB) + Core L#2
        PU L#4 (P#4)
        PU L#5 (P#5)
      L2 L#3 (256KB) + L1d L#3 (32KB) + L1i L#3 (32KB) + Core L#3
        PU L#6 (P#6)
        PU L#7 (P#7)
  HostBridge
    PCIBridge
      PCI 01:00.0 (IDE)
    PCIBridge
      PCI 03:00.0 (VGA)
    PCIBridge
      PCI 06:00.0 (Ethernet)
    PCIBridge
      PCI 05:00.0 (Ethernet)
    PCI 00:1f.2 (IDE)
    PCI 00:1f.5 (IDE)

yurivict avatar Jun 13 '24 04:06 yurivict

Thanks @yurivict. I think https://github.com/pika-org/pika/pull/1130/commits/3a999ce6c21fb9bacc34562d29a6628d19c98f49 (not yet merged) would also fix that test failure (i.e. further relax what we expect from the output there). We'll try to get that merged in the next weeks.

Do you think the remaining segfaults are related to your fix in #1171?

msimberg avatar Jun 13 '24 06:06 msimberg

Do you think the remaining segfaults are related to your fix in https://github.com/pika-org/pika/pull/1171?

No, because I ran the patched code.

yurivict avatar Jun 13 '24 06:06 yurivict

Ok, thank you in any case. If you find out anything more in the future do let us know.

msimberg avatar Jun 13 '24 07:06 msimberg