Steve Brasier

Results 155 comments of Steve Brasier

@bertiethorpe we'll also need an appropriate `environment/.stackhpc/SMS.pkrvars.hcl` file to define the same stuff for packer builds on sms-labs. Unlike the existing ones it can't use a floating IP, it'll need...

Actually - we need images in SMSlabs to test this anyway :cry:

I cancelled the tests b/c the current PR changes don't actually affect the CI environment

@sd109 I'm still interested in getting this in, see comment above.

@sd109 > Do we have a plan yet for how to get nodes to rejoin the k3s cluster after a compute-init driven rebuild? Is it even possible with this bootstrap...

Hum that's nasty. Do you know **why** you can't "query intf ib0"?? I guess the weak option is to create a dropin for the lnet.service which retries, I'm not sure...

Interesting, thanks. I think this has only ever been used on RL8 and to be honest I've been thinking we should remove support for IPA server entirely as it is...

@wtripp180901 not a high priority but would be nice to know if this PR reduces the size of the data in the image. And/or whether we can reduce the required...

@wtripp180901 when we get back to this we should look at whether https://github.com/stackhpc/ansible-slurm-appliance/blob/main/environments/common/inventory/group_vars/all/systemd.yml needs to be modified.

@wtripp180901 also re TASK [systemd : Add dropins for unit files]: - its a bit worrying that this is a change when runnign site.yml: ``` Wednesday 22 January 2025 16:54:02...