gpu-operator icon indicating copy to clipboard operation
gpu-operator copied to clipboard

NVIDIA GPU Operator creates/configures/manages GPUs atop Kubernetes

Results 392 gpu-operator issues
Sort by recently updated
recently updated
newest added

_The template below is mostly useful for bug reports and support questions. Feel free to remove anything which doesn't apply to you and add more information where it makes sense._...

Hi, we're maintaining an OpenShift v4.10 cluster, and recently provisioned Dell PowerEdge XE9680 servers as GPU nodes. We are working with NVIDIA GPU Operator v22.9.1 as for now (aware of...

_The template below is mostly useful for bug reports and support questions. Feel free to remove anything which doesn't apply to you and add more information where it makes sense._...

### 1. Quick Debug Information * OS/Version(e.g. RHEL8.6, Ubuntu22.04): Ubuntu20.04 * Kernel Version: 5.15.x * Container Runtime Type/Version(e.g. Containerd, CRI-O, Docker): CRI-O * K8s Flavor/Version(e.g. K8s, OCP, Rancher, GKE, EKS):...

### 1. Quick Debug Information * OS/Version: RHEL8.8 * Kernel Version:4.18.0-477.27.1.el8_8.x86_64 * Container Runtime Type/Versio: Containerd * K8s Flavor/Version: v1.26.2 * GPU Operator Version: 23.6.1 ### 2. Issue or feature...

question
more-information-needed

_The template below is mostly useful for bug reports and support questions. Feel free to remove anything which doesn't apply to you and add more information where it makes sense._...

In a Rancher-provisioned bare metal cluster I have two GPU nodes that cannot finish upgrade, their status is _validation-required_ and _pod-restart-required_. ### 1. Quick Debug Information * OS/Version - Ubuntu22.04:...

_The template below is mostly useful for bug reports and support questions. Feel free to remove anything which doesn't apply to you and add more information where it makes sense._...

_The template below is mostly useful for bug reports and support questions. Feel free to remove anything which doesn't apply to you and add more information where it makes sense._...

Steps to reproduce the issue: 1/ Set the RHEL 8 server in SELinux ENFORCING mode: ``` [nvidia@ipp1-0686 ~]$ sestatus SELinux status: enabled SELinuxfs mount: /sys/fs/selinux SELinux root directory: /etc/selinux Loaded...