bertiethorpe
bertiethorpe
new fatimage build: https://github.com/stackhpc/ansible-slurm-appliance/actions/runs/14081400474 (bumped above)
Checks cancelled because don't test changes
https://github.com/stackhpc/ansible-slurm-appliance/actions/runs/10417484239
https://github.com/stackhpc/ansible-slurm-appliance/actions/runs/10417484239/job/28860827536 CI workflow uploads images as RAW, download fails because it takes too long. QCOW2 still blocked? I thought CI uploads were supposed to be a work around for this
https://github.com/stackhpc/ansible-slurm-appliance/actions/runs/10451752138
Need an automated workflow for fat image uploads across all sites.
https://github.com/stackhpc/ansible-slurm-appliance/actions/runs/10700305057
Build: https://github.com/stackhpc/ansible-slurm-appliance/actions/runs/10700649561
Some more information: - This is all virtualised > Run ucx_info -e -u t -P inter with various UCX_NET_DEVICES and check whether the used devices are the ones you expect....
> can you pls run with UCX_NET_DEVICES=eth0 and also add -mca pml_base_verbose 99 -mca pml_ucx_verbose 99 -mca pml ucx to mpirun? [ucxlog.txt](https://github.com/user-attachments/files/16512363/ucxlog.txt)