glowkey
glowkey
These metrics should be supported by all GPU models but are not supported for MIG configurations. Do you see the metrics with 'dcgmi dmon'?
Use DCGM_FI_PROF_GR_ENGINE_ACTIVE and DCGM_FI_PROF_DRAM_ACTIVE, which is reported for MIG devices. I'd encourage you to look through the DCGM_FI_PROF* family of metrics, otherwise known as DCP metrics.
Which metrics are you monitoring? Can you attach the output of the exporter? Is the libnvidia-nscq library installed?
Note that nvswitches and nvlinks may not automatically be mounted inside the container. See https://github.com/NVIDIA/dcgm-exporter/issues/316#issuecomment-2087369233
Yes, can it be adjusted to handle the user specified log-level
Oh, and thank you for the submission!
In my testing this change was incompatible with the Capture mechanism and I wasn't seeing any output in either format. When I disabled Capture then I was able to test...