bertiethorpe

Results 22 comments of bertiethorpe

new fatimage build: https://github.com/stackhpc/ansible-slurm-appliance/actions/runs/14081400474 (bumped above)

https://github.com/stackhpc/ansible-slurm-appliance/actions/runs/10417484239

https://github.com/stackhpc/ansible-slurm-appliance/actions/runs/10417484239/job/28860827536 CI workflow uploads images as RAW, download fails because it takes too long. QCOW2 still blocked? I thought CI uploads were supposed to be a work around for this

https://github.com/stackhpc/ansible-slurm-appliance/actions/runs/10451752138

Need an automated workflow for fat image uploads across all sites.

https://github.com/stackhpc/ansible-slurm-appliance/actions/runs/10700305057

Build: https://github.com/stackhpc/ansible-slurm-appliance/actions/runs/10700649561

Some more information: - This is all virtualised > Run ucx_info -e -u t -P inter with various UCX_NET_DEVICES and check whether the used devices are the ones you expect....

> can you pls run with UCX_NET_DEVICES=eth0 and also add -mca pml_base_verbose 99 -mca pml_ucx_verbose 99 -mca pml ucx to mpirun? [ucxlog.txt](https://github.com/user-attachments/files/16512363/ucxlog.txt)