azurelinux icon indicating copy to clipboard operation
azurelinux copied to clipboard

cuda install with kernel-rt for azl 3.0

Open ankithmr opened this issue 1 year ago • 1 comments

I am using Azure linux 3.0 with RT kernel for example 6.6.35.1-rt34-1.azl3. However, cuda installs a different kernel and nvidia-smi command only runs with that.

Error with RT kernel:

afoedge@insedge-68 [ ~ ]$ nvidia-smi
NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.

If I boot the OS with RT kernel, nvidia-smi fails. Do you have a cuda rpm that corresponds to RT kernel ?

I also tried the the CM2 version but the nvidia-open driver fails to install:

https://github.com/microsoft/azurelinux/blob/3.0/toolkit/docs/nvidia/nvidia.md

root [ /opt/kfo ]# sudo tdnf -y install nvidia-open
Loaded plugin: tdnfrepogpgcheck
1. package nvidia-open-560.35.03-1.noarch requires nvidia-driver-cuda >= 560.35.03, but none of the providers can be installed
Found 1 problem(s) while resolving
Error(1301) : Solv general runtime error
root [ /opt/kfo ]#

ankithmr avatar Oct 18 '24 08:10 ankithmr

Can you please update on this ?

ankithmr avatar Oct 25 '24 10:10 ankithmr

Hello, Can you please update on this ? We are still having issues in installing nvidia-smi with AZL3 RT kernel

ankithmr avatar Jan 06 '25 11:01 ankithmr

Any updates ?

ankithmr avatar Jan 28 '25 16:01 ankithmr

Hello, Is there any progress with this ? I would like to install nvidia-smi with RT kernal !

ankithmr avatar Mar 26 '25 19:03 ankithmr

Any update on this ?

ankithmr avatar Jun 30 '25 13:06 ankithmr