gpu-operator icon indicating copy to clipboard operation
gpu-operator copied to clipboard

NVIDIA GPU Operator creates/configures/manages GPUs atop Kubernetes

Results 392 gpu-operator issues
Sort by recently updated
recently updated
newest added

Hello, Are nvidia-dcgm and nvidia-dcgm-exporter expected to work in an strictly ipv6 environment? Thanks, Babak gpu-operator version 1.8.1 Failure with nvidia-dcgm-exporter: [sysadmin@controller-0 ~(keystone_admin)]$ kubectl logs -n gpu-operator-resources nvidia-dcgm-exporter-nnmpn time="2021-11-30T15:37:19Z" level=info...

I run a rapidsai container with jupyter notebook. When I freshly start the container all is fine. I can run some GPU workload inside the notebook. ``` Thu Oct 14...

### 1. Issue or feature description I have created four clusters and installed GPU operator into all 4. Each cluster contains one node which has been given 1 of 8...

Today we updated the GPU operator in one of our OpenShift clusters (Version 4.6.35) to the version 1.7.1. The upgrade included uninstalling the old GPU operator version and installing everything...

Hello, Is there any plans for the Nvidia GPU Operator to support zLinux? Thanks, Scott

_The template below is mostly useful for bug reports and support questions. Feel free to remove anything which doesn't apply to you and add more information where it makes sense._...

### 1. Quick Debug Checklist - [x] Are you running on an Ubuntu 18.04 node? - [x] Are you running Kubernetes v1.13+? - [x] Are you running Docker (>= 18.06)...

We installed the GPU operator version 1.7.1 in one of our OCP 4.7.16 clusters. We are facing 2 issues with our current deployment: - the driver daemonset does not include...

as of driver nvcr.io/nvidia/driver:460.73.01-ubuntu20.04, compat32 option does not work as libs are all taken from ld.so.cache from the container. However, the container does not instruct linker to search also /usr/lib32,...

hi im using ubuntu 20.04 (kernel 5.4.0-62) and 460.32.03 nvidia driver image.also my gpu is 1660 ti. when i install the operator ,nvidia-driver-daemonset pod goes to running state and its...