nvidia_gpu_prometheus_exporter icon indicating copy to clipboard operation
nvidia_gpu_prometheus_exporter copied to clipboard

deploy gpu-exporter on a non-gpu node will get error and crash

Open Cherishty opened this issue 6 years ago • 1 comments

Hi @mindprince , the gpu exporter works pretty well on a gpu node, while it will get error when deployed on a non-gpu node. Of curse it is reasonable because there is no hardware and NVML on that node, but should us still enable the gpu-exporter, just does NOT display related metrics any more? So that it can respect the behavior as cadvisor

Actually I do need this behavior because I combine gpu-exporter can common node-export in one pod (as daemonset), which will run on each node (even for the node without GPU), and only in this way can I join the common-node-metrics with gpu-node-metrics together

Should we fix it? or any suggestion?

Best Regards

Cherishty avatar Dec 04 '18 08:12 Cherishty

i also have the same issue.

ghost avatar Dec 10 '18 12:12 ghost