kepler icon indicating copy to clipboard operation
kepler copied to clipboard

kepler_node_dram_joules_total = 0 on RHEL9/arm64 system

Open jharriga opened this issue 1 year ago • 1 comments

What happened?

Downloaded and installed

https://github.com/sustainable-computing-io/kepler/releases/download/v0.7.9/kepler.rpm.tar.gz

On server running

  • 5.14.0-362.21.1.el9_3.aarch64
  • Red Hat Enterprise Linux 9.3 (Plow)
  1. kepler_node_dram_joules_total = 0 kepler_node_dram_joules_total{instance="perf-arm-11.perf.eng.bos2.dc.redhat.com",mode="dynamic",package="0",source="intel_rapl"} 0

Ran several memory intensive workloads and metric value remained at '0'

What did you expect to happen?

expected the metric reading to increase/track system memory usage

How can we reproduce it (as minimally and precisely as possible)?

Download & install rpm start service root# systemctl start container-kepler --now root# curl localhost:8888/metrics | grep

Anything else we need to know?

No response

Kepler image tag

Kubernetes version

NONE

Cloud provider or bare metal

bare-metal

OS version

# On Linux:
$ cat /etc/os-release
Red Hat Enterprise Linux 9.3 (Plow)
$ uname -a
Linux perf-arm-11.perf.eng.bos2.dc.redhat.com 5.14.0-362.21.1.el9_3.aarch64 #1 SMP PREEMPT_DYNAMIC Thu Jan 25 08:27:11 EST 2024 aarch64 aarch64 aarch64 GNU/Linux

</details>


### Install tools

<details>
# rpm --version
RPM version 4.16.1.3

</details>


### Kepler deployment config

<details>

NONE

</details>


### Container runtime (CRI) and version (if applicable)

<details>

</details>


### Related plugins (CNI, CSI, ...) and versions (if applicable)

<details>

</details>

jharriga avatar Apr 11 '24 17:04 jharriga

Current Ampere xgene hwmon only reports the CPU and I/O power (per doc here). We cannot get DRAM power. So to align with the RAPL reporting, kepler only reports kepler_node_core_total (per code here)

rootfs avatar May 22 '24 20:05 rootfs