Shiva Krishna Merla
Shiva Krishna Merla
@hassanshabbirahmed @vietkute02 will debug the issue with CentOS7. Meanwhile can you edit the driver daemonset to edit the image to `nvcr.io/nvidia/driver:450.80.02-rhel7.9` and verify if this resolves it?
We are trying to validate this internally and will try out to fix this soon.
@manishdash12 Can you get the output of `oc get clusterversions -o yaml`, we look for the last successful updated version.(i.e state: Completed). ``` history: - completionTime: "2022-03-15T14:44:39Z" image: quay.io/openshift-release-dev/ocp-release@sha256:6a899c54dda6b844bb12a247e324a0f6cde367e880b73ba110c056df6d018032 startedTime:...
@manishdash12 any update on this?
@sricharanrobinsystems we don't support clusters with mixed OS distributions/versions currently. Also, we support only CentOS7. Is this output with CentOS8?
@tmbdev Please install with driver container disabled as you seem to have drivers pre-installed on the node already. `--set driver.enabled=false`. Or use latest versions of operator v1.11.0 where driver container...
Also, note that you don't have to pre-install the drivers in the first place and operator takes care of it. ``` - name: installing CUDA from NVIDIA shell: | cd...
got it, seems like this option needs to be updated with microk8s installs. This problem will not happen with v1.11.0 of operator as it will not try to overwrite drivers...
@rupang790 Driver container will always compile for current running kernel on the host. We cannot edit this, so you would need to use worker nodes for which packages are available....
@prpaul no, we don't support RHEL 8.x worker nodes, but only CoreOS. There is no plan to support RHEL worker nodes in the short term.