Failed to watch metrics: Error watching fields: The third-party Profiling module returned an u
What is the version?
3.3.5-3.4.1
What happened?
dcgm-exporter may encounter errors during the first startup, but the issue can be resolved by automatically restarting the service. The error message is as follows:
What did you expect to happen?
rt
What is the GPU model?
No response
What is the environment?
No response
How did you deploy the dcgm-exporter and what is the configuration?
No response
How to reproduce the issue?
No response
Anything else we need to know?
No response
There might be a race condition causing this behavior. Please attach the /var/log/nv-hostengine.log file from the container or enable DCGM logging with the following parameters:
--enable-dcgm-log --dcgm-log-level DEBUG
There might be a race condition causing this behavior. Please attach the /var/log/nv-hostengine.log file from the container or enable DCGM logging with the following parameters:
--enable-dcgm-log --dcgm-log-level DEBUG
"Okay, I'll give it a try."