ansible-role-nvidia-driver icon indicating copy to clipboard operation
ansible-role-nvidia-driver copied to clipboard

Problem due to package naming?

Open sanderant opened this issue 5 years ago • 2 comments

sanderant avatar Aug 31 '19 16:08 sanderant

It seems like the rpm package naming may have changed, or some problem with the kernel not being up to date with the latest driver version. This worked correctly with the 418.67 version. Below is the error I'm getting at the task level.

TASK [ansible-role-nvidia-driver : update kernel if headers don't match] 

*****************************************************************************************************************************************************************
fatal: [xxx]: FAILED! => {"changed": true, "msg": "Error: Could not find suitable Nvidia kernel module version for kernel kernel-3.10.0-957.el7.x86_64 and driver 3:nvidia-driver-latest-418.87.00-2.el7.x86_64\n", "rc": 1, "results": ["All packages providing kernel-debug-devel are up to date", "All packages providing kernel-headers are up to date", "Loaded plugins: fastestmirror, nvidia\n#### NVIDIA ####\nLoading mirror speeds from cached hostfile\n * base: mirror.cc.columbia.edu\n * extras: mirror.cc.columbia.edu\n * updates: mirror.cc.columbia.edu\nPackage kernel-debug-devel-3.10.0-957.27.2.el7.x86_64 already installed and latest version\nPackage kernel-headers-3.10.0-957.27.2.el7.x86_64 already installed and latest version\nResolving Dependencies\n--> Running transaction check\n---> Package kernel.x86_64 0:3.10.0-957.27.2.el7 will be installed\n---> Package kernel-devel.x86_64 0:3.10.0-957.27.2.el7 will be installed\n---> Package kernel-tools.x86_64 0:3.10.0-957.el7 will be updated\n---> Package kernel-tools.x86_64 0:3.10.0-957.27.2.el7 will be an update\n---> Package kernel-tools-libs.x86_64 0:3.10.0-957.el7 will be updated\n---> Package kernel-tools-libs.x86_64 0:3.10.0-957.27.2.el7 will be an update\n--> Finished Dependency Resolution\n"]}

However running the command below gets me the driver I need (at least on one of the hosts) yum install nvidia-driver-latest-dkms

So did the correct package name to the entry above, or is this doing something different?

sanderant avatar Aug 31 '19 16:08 sanderant

I am seeing the same error on my system. Running yum install nvidia-driver-latest-dkms seems to work just fine though.

sambristow-syrp avatar Sep 09 '19 21:09 sambristow-syrp