nvidia_gpu_prometheus_exporter
nvidia_gpu_prometheus_exporter copied to clipboard
deploy gpu-exporter on a non-gpu node will get error and crash
Hi @mindprince , the gpu exporter works pretty well on a gpu node, while it will get error when deployed on a non-gpu node. Of curse it is reasonable because there is no hardware and NVML on that node, but should us still enable the gpu-exporter, just does NOT display related metrics any more? So that it can respect the behavior as cadvisor
Actually I do need this behavior because I combine gpu-exporter can common node-export in one pod (as daemonset), which will run on each node (even for the node without GPU), and only in this way can I join the common-node-metrics with gpu-node-metrics together
Should we fix it? or any suggestion?
Best Regards
i also have the same issue.