Evan Lezar

Results 420 comments of Evan Lezar

@senshreyank could you please open a NEW issue with your problem. Please also provide the versions of the components that you are using.

Could you confirm that you're able to run `nvidia-smi` in the Kind worker node?

Since I don't have k0s experience, let's start out assuming that your goal is to install the GPU Operator in a Kind cluster with GPU support. This involves two stages:...

For "reasons" we were injecting the `/usr/bin/nvidia-ctk` binary from the host into the container for the `k8s-dra-driver`. This is what is causing: ``` dpkg: error processing archive /var/cache/apt/archives/nvidia-container-toolkit-base_1.15.0~rc.1-1_amd64.deb (--unpack): unable...

@joshuacox is containerd in the Kind node configured to use the `nvidia` runtime. In addition, if you don't set it to be the default you will have to add a...

This issue is probably due to the symlink creation not working under kind. Please update the environement for the validator in the ClusterPolicy to disable the creation of symlinks as...

You should be able to force containerd by setting the RUNTIME envvar: https://github.com/NVIDIA/nvidia-container-toolkit/blob/1ddc859700c0d698f7f155fdbf7ae6f77ea0c1f5/tools/container/nvidia-toolkit/run.go#L78 I'm not sure why docker is being detected by the operator. Which version are you installing?

@jmikedupont2 this is internal tooling for setting up testing infrastructure. Are you able to skip this submodule?

Hi @DevKyleS. We are aware of this issue. For the time being, please update the culster policy and add: ``` - name: DISABLE_DEV_CHAR_SYMLINK_CREATION value: "true" ``` to the `validator.driver.env`. cc...

@cdesiniotis @shivamerla any thoughts on this? Would exposing this as a config option (possibly changing the default back to the previous value) make sense? @heilerich do you have logs available...