Steve Brasier
Steve Brasier
@bertiethorpe we'll also need an appropriate `environment/.stackhpc/SMS.pkrvars.hcl` file to define the same stuff for packer builds on sms-labs. Unlike the existing ones it can't use a floating IP, it'll need...
Actually - we need images in SMSlabs to test this anyway :cry:
I cancelled the tests b/c the current PR changes don't actually affect the CI environment
@sd109 I'm still interested in getting this in, see comment above.
@sd109 > Do we have a plan yet for how to get nodes to rejoin the k3s cluster after a compute-init driven rebuild? Is it even possible with this bootstrap...
Hum that's nasty. Do you know **why** you can't "query intf ib0"?? I guess the weak option is to create a dropin for the lnet.service which retries, I'm not sure...
Interesting, thanks. I think this has only ever been used on RL8 and to be honest I've been thinking we should remove support for IPA server entirely as it is...
@wtripp180901 not a high priority but would be nice to know if this PR reduces the size of the data in the image. And/or whether we can reduce the required...
@wtripp180901 when we get back to this we should look at whether https://github.com/stackhpc/ansible-slurm-appliance/blob/main/environments/common/inventory/group_vars/all/systemd.yml needs to be modified.
@wtripp180901 also re TASK [systemd : Add dropins for unit files]: - its a bit worrying that this is a change when runnign site.yml: ``` Wednesday 22 January 2025 16:54:02...