gpu-operator
gpu-operator copied to clipboard
NVIDIA GPU Operator creates/configures/manages GPUs atop Kubernetes
The applyDriverAutoUpgradeAnnotation() function was applying the nvidia.com/gpu-driver-upgrade-enabled annotation to GPU nodes even when driver.enabled=false. This occurred because the function only checked if driver.upgradePolicy.autoUpgrade was true, without verifying that the driver...
Hi folks, We are using 4.18.21 in our Openshift environment. When we installed the Nvidia operator and used Nvidia driver version 535.261.03, there was no problem. We installed the new...
This PR contains the following updates: | Package | Update | Change | |---|---|---| | [nvcr.io/nvidia/cloud-native/k8s-kata-manager](https://gitlab-master.nvidia.com/gfn/security/container-hardening/distroless-sample-projects) ([source](https://redirect.github.com/)) | patch | `v0.2.3` -> `v0.2.4` | --- > [!WARNING] > Some dependencies...
This PR contains the following updates: | Package | Update | Change | |---|---|---| | nvcr.io/nvidia/k8s/dcgm-exporter | digest | `7c0ac44` -> `a1d9137` | --- > [!WARNING] > Some dependencies could...
This PR contains the following updates: | Package | Update | Change | |---|---|---| | nvcr.io/nvidia/k8s/container-toolkit | digest | `ef2a0ef` -> `0524041` | --- > [!WARNING] > Some dependencies could...
This PR contains the following updates: | Package | Update | Change | |---|---|---| | nvcr.io/nvidia/k8s-device-plugin | digest | `50ac011` -> `964847c` | --- > [!WARNING] > Some dependencies could...
Bumps [github.com/prometheus-operator/prometheus-operator/pkg/apis/monitoring](https://github.com/prometheus-operator/prometheus-operator) from 0.83.0 to 0.87.1. Release notes Sourced from github.com/prometheus-operator/prometheus-operator/pkg/apis/monitoring's releases. 0.87.1 / 2025-12-10 [BUGFIX] Fix the generated Alertmanager configuration for html and monospace fields of pushoverConfig receiver in...
hi ,i install vgpu host driver for ubuntu 22.04 lts on physical host (nvidia-vgpu-ubuntu-580_580.105.06_amd64.deb) , my k8s cluster gpu-operator(vgpu model) https://docs.nvidia.com/datacenter/cloud-native/gpu-operator/25.3.4/install-gpu-operator-vgpu.html after installed, pod crash always ``` root@kf1:~# kubectl get...
## Description Add a PR template to ensure contributors document how changes were tested and for general documentation purposes. ## Context (optional) We currently have no PR template. This makes...
- Updated README to document new vGPU license visibility features, including annotations for license status and a condition in ClusterPolicy. - Introduced license collection logic in `license_collector.go` and corresponding tests...