scheduler-plugins icon indicating copy to clipboard operation
scheduler-plugins copied to clipboard

[trimaran] Cpu metric not found in node metrics

Open namgizlat opened this issue 1 year ago • 6 comments

Area

  • [X] Scheduler
  • [ ] Controller
  • [ ] Helm Chart
  • [ ] Documents

Other components

No response

What happened?

while using the plugin trimaran following error appears in log: E0212 06:47:01.656601 1 targetloadpacking.go:142] "Cpu metric not found in node metrics" nodeName="node01.namgiz.com" nodeMetrics=[{Name:instance:node_memory_utilisation:ratio Type:Memory Operator:AVG Rollup:15m Value:11.70095792210995} {Name:instance:node_network_transmit_bytes:rate:sum Type:Bandwidth Operator:AVG Rollup:15m Value:8.235324565656565e+06} {Name:instance:node_network_receive_bytes:rate:sum Type:Bandwidth Operator:AVG Rollup:15m Value:1.6781439838383835e+07} {Name:instance:node_memory_utilisation:ratio Type:Memory Operator:STD Rollup:15m Value:0.03558027903707252} {Name:instance:node_network_transmit_bytes:rate:sum Type:Bandwidth Operator:STD Rollup:15m Value:502076.00473060855} {Name:instance:node_network_receive_bytes:rate:sum Type:Bandwidth Operator:STD Rollup:15m Value:3.155026755193178e+06}]

What did you expect to happen?

trimaran will use cpu metric for scheduling pods

How can we reproduce it (as minimally and precisely as possible)?

No response

Anything else we need to know?

Running on openshift (Version below)

Kubernetes version

Openshift Version: 4.12.40
Kubernetes Version: v1.25.14+20cda61

Scheduler Plugins version

registry.k8s.io/scheduler-plugins/kube-scheduler:v0.27.8

namgizlat avatar Feb 12 '24 06:02 namgizlat

The Kubernetes project currently lacks enough contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

  • After 90d of inactivity, lifecycle/stale is applied
  • After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
  • After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

  • Mark this issue as fresh with /remove-lifecycle stale
  • Close this issue with /close
  • Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

k8s-triage-robot avatar May 12 '24 07:05 k8s-triage-robot

@Huang-Wei this issue is due to some metric name upgrades specific to OpenShift. We have made some changes in load watcher so it supports both OpenShift and Kubernetes, should we push an update on the versions of load watcher for Trimaran?

wangchen615 avatar May 30 '24 15:05 wangchen615

should we push an update on the versions of load watcher for Trimaran?

Yes, please. And come up with a release note detailing what metrics are both supported.

Huang-Wei avatar Jun 01 '24 00:06 Huang-Wei

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

  • After 90d of inactivity, lifecycle/stale is applied
  • After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
  • After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

  • Mark this issue as fresh with /remove-lifecycle rotten
  • Close this issue with /close
  • Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle rotten

k8s-triage-robot avatar Jul 01 '24 01:07 k8s-triage-robot

@Huang-Wei this issue is due to some metric name upgrades specific to OpenShift. We have made some changes in load watcher so it supports both OpenShift and Kubernetes, should we push an update on the versions of load watcher for Trimaran?

Hi,

was it done?

namgizlat avatar Jul 29 '24 11:07 namgizlat

cc @wangchen615

/remove-lifecycle rotten

Huang-Wei avatar Jul 29 '24 18:07 Huang-Wei

The Kubernetes project currently lacks enough contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

  • After 90d of inactivity, lifecycle/stale is applied
  • After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
  • After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

  • Mark this issue as fresh with /remove-lifecycle stale
  • Close this issue with /close
  • Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

k8s-triage-robot avatar Oct 27 '24 18:10 k8s-triage-robot

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

  • After 90d of inactivity, lifecycle/stale is applied
  • After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
  • After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

  • Mark this issue as fresh with /remove-lifecycle rotten
  • Close this issue with /close
  • Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle rotten

k8s-triage-robot avatar Nov 26 '24 18:11 k8s-triage-robot

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues according to the following rules:

  • After 90d of inactivity, lifecycle/stale is applied
  • After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
  • After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

  • Reopen this issue with /reopen
  • Mark this issue as fresh with /remove-lifecycle rotten
  • Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/close not-planned

k8s-triage-robot avatar Dec 26 '24 19:12 k8s-triage-robot

@k8s-triage-robot: Closing this issue, marking it as "Not Planned".

In response to this:

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues according to the following rules:

  • After 90d of inactivity, lifecycle/stale is applied
  • After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
  • After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

  • Reopen this issue with /reopen
  • Mark this issue as fresh with /remove-lifecycle rotten
  • Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/close not-planned

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

k8s-ci-robot avatar Dec 26 '24 19:12 k8s-ci-robot