Update NVIDIA drivers up to version 550.x
Hello.
Current NVIDIA drivers in Talos are quite outdated. 535.129.03 version has been released six months ago. This version has CUDA 12.2, which is a problem for some new software, such as ffmpeg v7.
Is there any ETA for upgrading it to the latest 550.x branch?
There is no ETA at the moment, but update will happen at some point before the next Talos release.
Also updating to latest nvidia seems to break enterprise users, so we follow caution in updating nvidia drivers
Also updating to latest nvidia seems to break enterprise users, so we follow caution in updating nvidia drivers
I think with this update we should also support older drivers (need to figure out extension names vs. Talos upgrades), but basically provide both 535 and 550 for Talos 1.8.
Yes, we should figure this out
Theoretically, can we add some option to choose the extension version in factory.talos.dev?
No, you can use imager if you want your own custom image. And for NVIDIA it's anyways locked to the kernel build, so it won't work for a different version of Talos
If you get any hot build, I can test it... I would love to have an updated version. I've tried to play with the imager and some modules from here taking the same kernel version from nightlies, but it didn't boot up :(
I have a workflow to build the kernel and the Nvidia open kernel modules here, if you need inspiration. I'm running 6.10.3 with NVIDIA 550.107.02 on my nodes.
Starting Talos 1.8, both LTS and Production NVIDIA versions will be shipped as per support matrix: https://www.talos.dev/v1.8/introduction/support-matrix/