graylog2-server icon indicating copy to clipboard operation
graylog2-server copied to clipboard

Datanode mem usage metrics incorrect

Open mpfz0r opened this issue 1 year ago • 2 comments

The memory usage metrics on my datanode don't match reality. They seem to ramp up for an hour and then drop back to where they started. In reality the memory usage stays mostly constant and has never reached more than 18% of the systems memory.

image

I think the rollup which happens every hour might cause the wrong calculation.

Your Environment

  • Graylog Version: 6.0.0-rc2

mpfz0r avatar Apr 15 '24 20:04 mpfz0r

This metric is showing the heap used/heap commited percentage. The dips are from where the gc kicks in. So, the graph is correct, but maybe not the most useful one for users. You could deduct if there is a memory leak if the baseline is rising over time. Do you have a suggestion for a better metric to monitor?

moesterheld avatar May 03 '24 09:05 moesterheld

The dips are from where the gc kicks in

are you sure about that? The GC certainly runs not only hourly.

Or are you saying this is the usage percentage of the process internal pool?

mpfz0r avatar May 22 '24 15:05 mpfz0r