gpu-operator icon indicating copy to clipboard operation
gpu-operator copied to clipboard

NVIDIA GPU Operator creates/configures/manages GPUs atop Kubernetes

Results 392 gpu-operator issues
Sort by recently updated
recently updated
newest added

_The template below is mostly useful for bug reports and support questions. Feel free to remove anything which doesn't apply to you and add more information where it makes sense._...

### 1. Quick Debug Information * OS/Version(e.g. RHEL8.6, Ubuntu22.04): Ubuntu20.04 * Kernel Version: 5.15.0-89-generic * Container Runtime Type/Version(e.g. Containerd, CRI-O, Docker): Docker * K8s Flavor/Version(e.g. K8s, OCP, Rancher, GKE, EKS):...

### 1. Quick Debug Checklist - [ ] Are you running on an Ubuntu 18.04 node? - [x] Are you running Kubernetes v1.13+? - [x] Are you running Docker (>=...

_The template below is mostly useful for bug reports and support questions. Feel free to remove anything which doesn't apply to you and add more information where it makes sense._...

### 1. Quick Debug Checklist - Run on OpenShift v4.10.16 - GPU operator version v22.9.1 ### 2. Issue or feature description The GPU operator currently arrives with 2 PrometheusRule objects:...

### 1. Quick Debug Information * OS/Version: Rhel 8.8 * Kernel Version: 4.18.0-477.15.1.el8_8.x86_64 * Container Runtime Type/Version(e.g. Containerd, CRI-O, Docker): Containerd * K8s Flavor/Version(e.g. K8s, OCP, Rancher, GKE, EKS): k8s...

### 1. Quick Debug Information * OS/Version(e.g. RHEL8.6, Ubuntu22.04): Ubuntu22.04 * Kernel Version: 5.15.0-1048-gke * Container Runtime Type/Version(e.g. Containerd, CRI-O, Docker): Containerd * K8s Flavor/Version(e.g. K8s, OCP, Rancher, GKE, EKS):...

needs-triage

### Symptoms After the upgrade to v23.6.0 of the operator all deployments are stuck because the `nvidia-container-toolkit-daemonset` DaemonSet stays in the `Init:0/1` phase indefinitely. The logs of the init stage...

_The template below is mostly useful for bug reports and support questions. Feel free to remove anything which doesn't apply to you and add more information where it makes sense._...

_The template below is mostly useful for bug reports and support questions. Feel free to remove anything which doesn't apply to you and add more information where it makes sense._...