software-layer icon indicating copy to clipboard operation
software-layer copied to clipboard

disable using `x86_64/amd/zen3` installations when `x86_64/amd/zen4` is detected

Open boegel opened this issue 1 year ago • 4 comments

This makes sense now, since we've caught up in x86_64/amd/zen4, except for installations with the 2022b generation of easyconfigs which are too old for AMD Genoa.

Next to this change, we should also update our Lmod hook to print a clear error why those modules are not available for zen4?

boegel avatar Sep 30 '24 14:09 boegel

Instance eessi-bot-mc-aws is configured to build for:

  • architectures: x86_64/generic, x86_64/intel/haswell, x86_64/intel/skylake_avx512, x86_64/amd/zen2, x86_64/amd/zen3, aarch64/generic, aarch64/neoverse_n1, aarch64/neoverse_v1
  • repositories: eessi.io-2023.06-compat, eessi-hpc.org-2023.06-software, eessi-hpc.org-2023.06-compat, eessi.io-2023.06-software

eessi-bot[bot] avatar Sep 30 '24 14:09 eessi-bot[bot]

Instance eessi-bot-mc-azure is configured to build for:

  • architectures: x86_64/amd/zen4
  • repositories: eessi-hpc.org-2023.06-software, eessi-hpc.org-2023.06-compat, eessi.io-2023.06-software, eessi.io-2023.06-compat

eessi-bot[bot] avatar Sep 30 '24 14:09 eessi-bot[bot]

Instance boegel-bot-deucalion is configured to build for:

  • architectures: aarch64/a64fx
  • repositories: eessi.io-2023.06-software

I assume this has to wait for https://gitlab.com/eessi/support/-/issues/37 to have the symlinking for 2022b resolved? Should we put this on draft until then to avoid accidental merge?

casparvl avatar Oct 07 '24 14:10 casparvl

@bedroge This should be good to go too now?

boegel avatar Jan 25 '25 08:01 boegel

@bedroge This should be good to go too now?

In the gitlab issue you mentioned that you were also going to change the modulefile in this PR?

bedroge avatar Jan 25 '25 09:01 bedroge

@bedroge This should be good to go too now?

In the gitlab issue you mentioned that you were also going to change the modulefile in this PR?

Ah, yes, indeed, let's do that here... I'll update the PR

boegel avatar Jan 25 '25 09:01 boegel

bot: build repo:eessi.io-2023.06-software arch:x86_64/amd/zen4

bedroge avatar Jan 25 '25 10:01 bedroge

Updates by the bot instance eessi-bot-mc-aws (click for details)
  • received bot command build repo:eessi.io-2023.06-software arch:x86_64/amd/zen4 from bedroge

    • expanded format: build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen4
  • handling command build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen4 resulted in:

    • no jobs were submitted

eessi-bot[bot] avatar Jan 25 '25 10:01 eessi-bot[bot]

Updates by the bot instance eessi-bot-mc-azure (click for details)
  • received bot command build repo:eessi.io-2023.06-software arch:x86_64/amd/zen4 from bedroge

    • expanded format: build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen4
  • handling command build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen4 resulted in:

    • submitted job 67, for details & status see https://github.com/EESSI/software-layer/pull/766#issuecomment-2613916336

eessi-bot[bot] avatar Jan 25 '25 10:01 eessi-bot[bot]

New job on instance eessi-bot-mc-azure for CPU micro-architecture x86_64-amd-zen4 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2025.01/pr_766/67

date job status comment
Jan 25 10:20:33 UTC 2025 submitted job id 67 awaits release by job manager
Jan 25 10:20:41 UTC 2025 released job awaits launch by Slurm scheduler
Jan 25 10:25:44 UTC 2025 running job 67 is running
Jan 25 10:33:55 UTC 2025 finished
:grin: SUCCESS (click triangle for details)
Details
:white_check_mark: job output file slurm-67.out
:white_check_mark: no message matching FATAL:
:white_check_mark: no message matching ERROR:
:white_check_mark: no message matching FAILED:
:white_check_mark: no message matching required modules missing:
:white_check_mark: found message(s) matching No missing installations
:white_check_mark: found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-amd-zen4-1737800799.tar.gzsize: 0 MiB (5005 bytes)
entries: 2
modules under 2023.06/software/linux/x86_64/amd/zen4/modules/all
no module files in tarball
software under 2023.06/software/linux/x86_64/amd/zen4/software
no software packages in tarball
other under 2023.06/software/linux/x86_64/amd/zen4
2023.06/init/eessi_environment_variables
2023.06/init/modules/EESSI/2023.06.lua
Jan 25 10:33:55 UTC 2025 test result
:grin: SUCCESS (click triangle for details)
ReFrame Summary
[ OK ] ( 1/10) EESSI_LAMMPS_lj %device_type=cpu %module_name=LAMMPS/29Aug2024-foss-2023b-kokkos %scale=1_node /aeb2d9df @BotBuildTests:x86-64-amd-zen4-node+default
P: perf: 1806.546 timesteps/s (r:0, l:None, u:None)
[ OK ] ( 2/10) EESSI_LAMMPS_lj %device_type=cpu %module_name=LAMMPS/2Aug2023_update2-foss-2023a-kokkos %scale=1_node /04ff9ece @BotBuildTests:x86-64-amd-zen4-node+default
P: perf: 1780.187 timesteps/s (r:0, l:None, u:None)
[ OK ] ( 3/10) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_allreduce %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node %device_type=cpu /775175bf @BotBuildTests:x86-64-amd-zen4-node+default
P: latency: 4.07 us (r:0, l:None, u:None)
[ OK ] ( 4/10) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_allreduce %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node %device_type=cpu /52707c40 @BotBuildTests:x86-64-amd-zen4-node+default
P: latency: 4.37 us (r:0, l:None, u:None)
[ OK ] ( 5/10) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_alltoall %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node %device_type=cpu /b1aacda9 @BotBuildTests:x86-64-amd-zen4-node+default
P: latency: 11.21 us (r:0, l:None, u:None)
[ OK ] ( 6/10) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_alltoall %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node %device_type=cpu /c6bad193 @BotBuildTests:x86-64-amd-zen4-node+default
P: latency: 10.87 us (r:0, l:None, u:None)
[ OK ] ( 7/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node /15cad6c4 @BotBuildTests:x86-64-amd-zen4-node+default
P: latency: 0.53 us (r:0, l:None, u:None)
[ OK ] ( 8/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node /6672deda @BotBuildTests:x86-64-amd-zen4-node+default
P: latency: 0.55 us (r:0, l:None, u:None)
[ OK ] ( 9/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node /2a9a47b1 @BotBuildTests:x86-64-amd-zen4-node+default
P: bandwidth: 44751.4 MB/s (r:0, l:None, u:None)
[ OK ] (10/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node /1b24ab8e @BotBuildTests:x86-64-amd-zen4-node+default
P: bandwidth: 47720.11 MB/s (r:0, l:None, u:None)
[ PASSED ] Ran 10/10 test case(s) from 10 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
:white_check_mark: job output file slurm-67.out
:white_check_mark: no message matching ERROR:
:white_check_mark: no message matching [\s*FAILED\s*].*Ran .* test case
Jan 25 10:59:20 UTC 2025 uploaded transfer of eessi-2023.06-software-linux-x86_64-amd-zen4-1737800799.tar.gz to S3 bucket succeeded

eessi-bot[bot] avatar Jan 25 '25 10:01 eessi-bot[bot]

Tarball has been ingested.

bedroge avatar Jan 25 '25 14:01 bedroge

PR merged! Moved [] to /project/def-users/SHARED/trash_bin/EESSI/software-layer/2025.01.25

eessi-bot[bot] avatar Jan 25 '25 14:01 eessi-bot[bot]

PR merged! Moved ['/project/def-users/SHARED/jobs/2025.01/pr_766/67'] to /project/def-users/SHARED/trash_bin/EESSI/software-layer/2025.01.25

eessi-bot[bot] avatar Jan 25 '25 14:01 eessi-bot[bot]