Steve Brasier
Steve Brasier
Cancelled tests after rebase, building image: https://github.com/stackhpc/ansible-slurm-appliance/actions/runs/13333602095 NB: once bumped, this should trigger a rebuild rather than reimage
Note the above is out of date!
> nit: Does the compute module need a test to check that we're not passing in a string or something that it doesn't know how to interpret? That'd be lovely...
Apologies for the slow response. So, this has changed very rapidly over the last couple of months or so, backstory here: https://stackhpc.com/slurm-package-changes.html. In general, the instructions assume you are using...
selinux is not disabled by default, hence this occurs with any unmodified cookiecutter environment regardless of image. See https://github.com/stackhpc/ansible-slurm-appliance/blob/main/environments/common/inventory/group_vars/all/selinux.yml. We do disable it in CI: https://github.com/stackhpc/ansible-slurm-appliance/blob/main/environments/.stackhpc/inventory/group_vars/selinux/overrides.yml
Also a bit horrible that as the stackhpc env has selinux disabled, running site.yml using a default env (with it enabled) on our image means it reboots to change the...
Ok so the (depreciated) cloudalchemy.prometheus role is looking for the wrong packages: ``` TASK [cloudalchemy.prometheus : Gather variables for each operating system] ******************************************************************************************************************************************************** Friday 03 January 2025 11:11:57 +0000 (0:00:00.042)...
Also explains why this works on a client with SELinux enabled (on control node), b/c they are on RockyLinux 8.
Changed in https://github.com/stackhpc/ansible-slurm-appliance/pull/473
@sd109 @bertiethorpe this still has a TODO in the first comment? If this isn't going to be done can you just rephrase this as a limitation please?