bstollenvidia
bstollenvidia
I confirmed this is still present in 3.1.6 https://github.com/NVIDIA/DCGM/blob/4aedfaae1f7c8480e46b8c835ddd5afbd00d57be/testing/python3/DcgmReader.py#L400
A couple debugging steps: Can you share the output from the following commands: 1. dcgmi --version 2. dcgmi discovery -l
All DCGM packages and docs can be obtained here: https://developer.nvidia.com/data-center-gpu-manager-dcgm
NVVS (soon to be called DCGM GPU Diagnostic) is part of the DCGM package and can be obtained here: https://developer.nvidia.com/data-center-gpu-manager-dcgm Note that the link to the NVVS user guide is...
DCGM supports profiling metrics on all Volta and newer compute GPUs, including Ampere and Ada ones.