kepler
kepler copied to clipboard
cpu time becomes zero after long run
Describe the bug After running for extended hours, the curr_cpu_time becomes zero and never recovers
{
"__name__": "node_energy_stat",
"container": "kepler-exporter",
"cpu_architecture": "Cascade Lake",
"curr_cache_misses": "10019190",
"curr_cpu_cycles": "4005839184",
"curr_cpu_instructions": "5068047632",
"curr_cpu_time": "0.000000", <----------- this never recovers
"curr_energy_in_core": "154835.000000",
"curr_energy_in_dram": "5.000000",
"curr_energy_in_gpu": "0.000000",
"curr_energy_in_other": "0.000000",
"curr_resident_memory": "10457391104.000000",
"endpoint": "http",
"instance": "xxxxx",
"job": "kepler-exporter",
"namespace": "monitoring",
"node_name": "xxxx",
"pod": "kepler-exporter-9k75l",
"service": "kepler-exporter"
}
To Reproduce Steps to reproduce the behavior:
- Go to '...'
- Click on '....'
- Scroll down to '....'
- See error
Expected behavior A clear and concise description of what you expected to happen.
Screenshots If applicable, add screenshots to help explain your problem.
Desktop (please complete the following information):
- OS: [e.g. iOS]
- Browser [e.g. chrome, safari]
- Version [e.g. 22]
Smartphone (please complete the following information):
- Device: [e.g. iPhone6]
- OS: [e.g. iOS8.1]
- Browser [e.g. stock browser, safari]
- Version [e.g. 22]
Additional context Add any other context about the problem here.
cpu time is calculated here cc @marceloamaral
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.
@rootfs do we still have this problem?
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.