dwschulze

Results 16 comments of dwschulze

I can confirm that the plugin works with docker 1.19.03 with the nvidia-container-runtime installed and does not need nvidia-docker2.

That file doesn't exist (on nodes or master) so it looks like the plugin removed it: $ sudo ls -l /var/lib/kubelet/device-plugins/ total 4 -rw-r--r-- 1 root root 0 Jun 8...

I think we solved this in the discussion of issue #176. Creating / deleting the daemonset creates / deletes the /var/lib/kubelet/device-plugins/nvidia.sock file. The Run Locally instructions probably require that haven’t...

Running `kubectl describe nodes` shows that I have 0 gpus allocatable, which would explain why my pods don't get out of pending state. The plugin is not recognizing my gpus.

Did you mean `nvidia-docker run nvidia/cuda nvidia-smi` ? ``` $ docker run nvidia/cuda nvidia-smi docker: Error response from daemon: OCI runtime create failed: container_linux.go:349: starting container process caused "exec: \"nvidia-smi\":...

The directory /etc/docker got overwritten when I was reinstalling things this morning. I made the changes you showed in /etc/docker/daemon.json, restarted kubelet on the nodes and created the daemonset again...

Restarting docker allows the docker command to run, and kubectl describe nodes shows 1 gpu Allocatable. When I try to run the examples from this page: https://docs.nvidia.com/datacenter/kubernetes/kubernetes-upstream/index.html (have to clone...

That one fails to deploy. I’ve cloned the repo and checkout the examples branch and when I run deployment.yml from my file system it creates a deployment with 32 replicas...

Two things I've noticed. Creating/deleting the plugin daemonset creates / deletes the file /var/lib/kubelet/device-plugins/nvidia.sock on the nodes. The [nvidia-device-plugin.yml](https://raw.githubusercontent.com/NVIDIA/k8s-device-plugin/1.0.0-beta6/nvidia-device-plugin.yml) contains ``` containers: - image: nvidia/k8s-device-plugin:1.0.0-beta6 ``` There is no nvidia/k8s-device-plugin...

> There shouldn't be any plugins on your master -- it only runs on worker nodes. Oh, they're on the nodes: nvidia/k8s-device-plugin 1.0.0-beta6 c0fa7866a301 6 weeks ago 64.2MB Also, which...