gpu-operator
gpu-operator copied to clipboard
NVIDIA GPU Operator creates/configures/manages GPUs atop Kubernetes
current in the helm chart values.yaml define many k8s-driver-manager image, but version unlikeness, so we can define a unified field and unified version?
Bumps [golang.org/x/net](https://github.com/golang/net) from 0.10.0 to 0.17.0. Commits b225e7c http2: limit maximum handler goroutines to MaxConcurrentStreams 88194ad go.mod: update golang.org/x dependencies 2b60a61 quic: fix several bugs in flow control accounting 73d82ef...
Bumps [golang.org/x/crypto](https://github.com/golang/crypto) from 0.14.0 to 0.17.0. Commits 9d2ee97 ssh: implement strict KEX protocol changes 4e5a261 ssh: close net.Conn on all NewServerConn errors 152cdb1 x509roots/fallback: update bundle fdfe1f8 ssh: defer channel...
Bumps [github.com/containerd/containerd](https://github.com/containerd/containerd) from 1.7.6 to 1.7.11. Release notes Sourced from github.com/containerd/containerd's releases. containerd 1.7.11 Welcome to the v1.7.11 release of containerd! The eleventh patch release for containerd 1.7 contains various...
### 1. Quick Debug Information * OS/Version(e.g. RHEL8.6, Ubuntu22.04): Centos7.9 * Kernel Version: Linux a800-master 3.10.0-1160.95.1.el7.x86_64 SMP Mon Jul 24 13:59:37 UTC 2023 x86_64 x86_64 x86_64 GNU/Linux * Container Runtime...
I have deployed gpu-operator on a rke2 cluster with the following values.yaml ``` nfd: enabled: true mig: strategy: single psp: enabled: false driver: enabled: true repository: nvcr.io/nvidia version: "525.60.13" rdma:...
### 1. Quick Debug Information * OS/Version(e.g. RHEL8.6, Ubuntu22.04): COS/Ubuntu * Kernel Version: * Container Runtime Type/Version(e.g. Containerd, CRI-O, Docker): Containerd * K8s Flavor/Version(e.g. K8s, OCP, Rancher, GKE, EKS): GKE...
When following the steps defined at: https://docs.nvidia.com/datacenter/cloud-native/gpu-operator/openshift/install-gpu-ocp.html#create-the-clusterpolicy-instance I don't ultimately get an entitlement free build system. I do see the driver container ``` Tylers-MacBook-Pro:~ tylerlisowski$ oc get imagestream -n openshift...
### 1. Issue or feature description When following the quickstart I end up with this error in `k describe po -n gpu-operator gpu-feature-discovery-6tk4h` Warning FailedCreatePodSandBox 0s (x5 over 49s) kubelet...
### 1. Quick Debug Information * OS/Version(e.g. RHEL8.6, Ubuntu22.04): RHEL 8.4 (Ootpa) * Kernel Version: 4.18.0-305.88.1.el8_4.x86_64 * Container Runtime Type/Version(e.g. Containerd, CRI-O, Docker): Containerd * K8s Flavor/Version(e.g. K8s, OCP, Rancher,...