HAMi icon indicating copy to clipboard operation
HAMi copied to clipboard

if use gpu-operator install nvidia driver, when node restart, driver-plugin start CrashLoopBackOff

Open lengrongfu opened this issue 1 year ago • 3 comments

  1. when the node reboots after, gpu-operator can reinstall NVIDIA driver;
  2. but the same time device-plugin pod starts, but this driver does not install completely. so pod in CrashLoopBackOff.

we can add a initContainers to check nvidia driver to resolve this problme.

lengrongfu avatar Jan 24 '24 07:01 lengrongfu