dcgm-exporter icon indicating copy to clipboard operation
dcgm-exporter copied to clipboard

How to get current device MIG model is single or mixed?

Open lengrongfu opened this issue 11 months ago • 2 comments

I have an A800 device, when I open the MIG model, we can't from dcgm-exporter get a metric know the current device MIG model is single or mixed.

lengrongfu avatar Mar 19 '24 07:03 lengrongfu

Please answer the following questions to get better assistance:

What happened?

Tell us what happened and provide as many details as possible, including logs.

What did you expect to happen?

Tell us about expected behaviour.

What is the GPU model?

Tell us about the hardware configuration of the GPU, including the output of 'nvidia-smi'

What is the environment?

Is DCGM-Exporter running on bare metal or in a virtual environment, container, pod, etc?

How did you deploy the dcgm-exporter and what is the configuration?

Tell us how you deployed DCGM-Exporter. Did you use helm, build from source or use the GPU Operator?

How can we reproduce the issue?

Clear and concise steps to reproduce an issue can help everyone by allowing us to identify and fix problems more quickly.

What is the version?

Tell us about DCGM-exporter version.

nvvfedorov avatar Mar 19 '24 14:03 nvvfedorov

@lengrongfu, The GPU feature discovery (https://github.com/NVIDIA/gpu-feature-discover) offers the "nvidia.com/mig.strategy" node labels. Do you want to see this label as part of the metric output?

Can you tell us your use case?

nvvfedorov avatar Apr 02 '24 15:04 nvvfedorov