ansible-devops
ansible-devops copied to clipboard
Can't pull predefined version of NVIDIA GPU driver (nvidia_gpu role)
https://github.com/ibm-mas/ansible-devops/blob/a4c2c3b04e4967bdc5da62b00bec2b0923aa7b33/ibm/mas_devops/roles/nvidia_gpu/defaults/main.yml#L10
I updated gpu-cluster-policy (ClusterPolicy kind) to use latest driver and it helped.
Just noticed RHOCS 4.14 is based on RHEL 9.2 https://access.redhat.com/articles/6907891
edit) For a workaround, use the GPU_DRIVER_VERSION environment variable to manually define the driver version.
export GPU_DRIVER_VERSION=550.54.14
Looking into this now for you
https://jsw.ibm.com/browse/MASCORE-4342