zabbix-nvidia-smi-multi-gpu icon indicating copy to clipboard operation
zabbix-nvidia-smi-multi-gpu copied to clipboard

Cannot collect info of GPU resource on VM

Open ryupim opened this issue 2 years ago • 3 comments

I want to monitor GPU resource on VM using zabbix. I followed the instructions in the README, but the VM outputs the error shown below. On bare metal, it worked. If you know a solution, please let me know.

■ Environment Host OS: vSphere ESXi 7.0U3 GPU: A40 Guest OS: Windows 10 Pro GPU profile (Guest OS): NVIDIA GRID vGPU nvidia_a40-8q GPU driver (Guest OS & Host OS): 510.47.03

スクリーンショット 2022-09-26 13 14 39

ryupim avatar Sep 26 '22 04:09 ryupim

Hi,

At first glance, it looks like the .bat file does not work correctly.

Did you make it work?

plambe avatar Oct 03 '22 12:10 plambe

Thank you for reply. Because PATH of VM wasn't included nvidia-smi, I added it. But, Zabbix Server shows below error. "Invalid discovery rule value: cannot parse as a valid JSON object: invalid JSON object value starting character at: ''nvidia-smi.exe' is not recognized as an internal or external command, operable program or batch file."

I'll continue to look into it.

ryupim avatar Oct 11 '22 05:10 ryupim

Have you been able to run the nvidia-smi command manually on it's own without zabbix or any bat scripts being involved?

It might be that you could use a fully qualified path to refer to the nvidia-smi command.

RichardKav avatar Oct 17 '22 20:10 RichardKav