Shiva Krishna Merla comments

Results 278 comments of


                                            Shiva Krishna Merla

nvidia-driver-validation crashloopbackoff

@hbahadorzadeh can you try out `v1.7.0` and check if the validation pod still crashing?

nvidia-driver-validation crashloopbackoff

@rhysjtevans I think we had verified with `460.32.03` with CentOS8. I will look into details of compilation errors soon.

nvidia-driver-validation crashloopbackoff

@rhysjtevans The comment [here](https://github.com/NVIDIA/libnvidia-container/blob/master/src/nvc_container.c#L348) explains the reason why `.real` file is preferred. But if this is not present libnvidia-container should fall back to using `/sbin/ldconfig` file. @elezar to confirm why...

Nvidia GPU operator failing to install on OpenShift with dedicated rather than shared nodes

@smithbk by default gpu-operator pod deployed through OLM doesn't have any specific nodeSelector/tolerations. Did you add the nodeSelector by editing the CSV? Can you get the podSpec for operator Deployment...

Nvidia GPU operator failing to install on OpenShift with dedicated rather than shared nodes

@smithbk Can you get the taints on the GPU nodes? We probably need to add those tolerations to the GPU Operator pod. This is done by editing CSV. `oc get...

Nvidia GPU operator failing to install on OpenShift with dedicated rather than shared nodes

@smithbk Can you describe the project `oc describe project nvidia-gpu-operator` to see if annotation was added to pick the nodeSelector? https://docs.openshift.com/container-platform/4.10/nodes/scheduling/nodes-scheduler-taints-tolerations.html#nodes-scheduler-taints-tolerations-projects_nodes-scheduler-taints-tolerations