kepler Kepler is very slow to start

Describe the bug Before the PR #225, Kepler was taking 1.18ms to start. But now, it is taking 40.17s to start

Although It might be related to the model server, the default deployment should have the model server disabled...

To Reproduce Steps to reproduce the behavior: The PR #307 introduce a log with the start time elapsed time.

Expected behavior Kepler should start faster, specially if the model server is disabled....

Oct 18 '22 04:10 marceloamaral

/cc @sunya-ch

Oct 18 '22 04:10 marceloamaral

The current implementation there is no flag to omit getting initial weight by connecting model-server or loading model weight. We can just add that flag in the config and use it as a condition to skip this below function. https://github.com/sustainable-computing-io/kepler/blob/605dc9cf79d1e7e600ef6e0a468d964b73be6a72/pkg/collector/metrics.go#L108

Oct 18 '22 05:10 sunya-ch

I tried to submit a PR but comment ths line but got following (can check it and update somewhere) ,so what's the impact of remove such function? e.g estimate function not work at all?

panic: runtime error: index out of range [0] with length 0

goroutine 66 [running]:
github.com/sustainable-computing-io/kepler/pkg/collector.(*Collector).reader.func1()
        /opt/app-root/src/github.com/sustainable-computing-io/kepler/pkg/collector/reader.go:310 +0x113e
created by github.com/sustainable-computing-io/kepler/pkg/collector.(*Collector).reader
        /opt/app-root/src/github.com/sustainable-computing-io/kepler/pkg/collector/reader.go:256 +0x85

Oct 20 '22 06:10 jichenjc

The impact is that there will be no initial model server to be downloaded and no estimate model will be applied. If no both node/RAPL power measured, it could cause that error. I will push another PR to fix that case by at least returns zeros.

Oct 20 '22 08:10 sunya-ch

The impact is that there will be no initial model server to be downloaded and no estimate model will be applied.

so for education purpose, no such model doesn't impact main metrics collect function so they can start kepler faster?

Oct 20 '22 08:10 jichenjc

Yes. I should not affect the main metric collect function.

Oct 20 '22 09:10 sunya-ch

Closing because PR #316

Oct 21 '22 02:10 marceloamaral

kepler kepler copied to clipboard

Kepler is very slow to start

kepler
kepler copied to clipboard