Maxime Schmitt
Maxime Schmitt
Hello @johnnynunez, I looked at the documentation and it seems to me that the tensor cores are treated as part of the compute resources. I cannot find any function to...
@HTOgit The support for the [pcie_bw file](https://dri.freedesktop.org/docs/drm/gpu/amdgpu.html#pcie-bw) is in place. However this file is empty on the system I am testing (RX 6800XT, Kernel 5.19.6). The only explanation that I...
Hello, Nvtop uses the dynamic linker to load the libraries. If the libraries are not in the linker search path you can add additional paths using LD_LIBRARY_PATH (`man ld.so`), for...
Hello @jjk334, Could you please indicate which version of nvtop you are using? You can use `nvtop --version` to get that information. If you are not using v3.0.0, could you...
Hello, I don't know how Slurm allocates the GPUs, could you check if the library `libnvidia-ml.so` is available? That's the library used to get the GPU information, `nvidia-smi` directly queries...
I guess that it's hard for a process to know if it's running in a VM/container. Maybe add a column showing the `hostname` of the machine which is probably the...
@Umio-Yasuno, thanks for pointing that out. By looking at the [kernel source code of amdgpu discovery](https://elixir.bootlin.com/linux/latest/source/drivers/gpu/drm/amd/amdgpu/amdgpu_discovery.c#L1641) it seems that I cannot rely solely on the family being >= of [AMDGPU_FAMILY_NV)](https://elixir.bootlin.com/linux/latest/source/include/uapi/drm/amdgpu_drm.h#L1261)...
I just pushed the [patch](https://github.com/Syllo/nvtop/commit/fb9ef11db110928eb0ff377050efb5b4980e59f6) to master (tested on VCE version 1 though) but it should now detect shared Enc+Dec on version >= 4.
Hello, That does not seem too difficult to implement. I guess that we can use the theoretical limit for PCIe bandwidth as reference for the percentage calculation.
Hello, I think that I might have to update the Nvidia backend to support MIG device handles. Although there is a notice in the documentation: "In MIG mode, if device...