temboard icon indicating copy to clipboard operation
temboard copied to clipboard

Remove UNDEF "Streaming replication connection" Status After Role Change to Primary

Open bsislow opened this issue 2 years ago • 8 comments

How do we remove the check for "Streaming replication connection" after a cluster has been switched over into a primary?

It shows as the following whether enabled or disabled: image

We also removed replication_connection,replication_lag from temboard-agent.conf and restarted the agent, but the status check still shows up.

Thanks.

bsislow avatar Dec 14 '21 15:12 bsislow

Hi @bsislow . This is by design. If a check has ever been defined, it is shown as undef once it's disabled. We can't distinguish a check with no data because it is now useless and a check that has no data because of an error or a misconfiguration. We need to review the design for this.

bersace avatar Mar 03 '22 10:03 bersace

Thanks, it's just a bit misleading as UNDEF shows up immediately after CRIT and WARN on the overall dashboard and looks suspect even though we can ignore it apparently...

image

bsislow avatar Mar 03 '22 14:03 bsislow

Thanks, that's a good feedback on UX. We'll see what to do on this.

bersace avatar Mar 03 '22 16:03 bersace

Hi @bersace , what unit for threshold streaming replication lag base on?

wahyubudiman avatar Mar 13 '22 14:03 wahyubudiman

@wahyubudiman sorry, I don't understand your question, can you elaborate ?

bersace avatar Mar 14 '22 11:03 bersace

hi @bersace , about the streaming replication lag threshold ,showing 2e+06 for warning threshold, i want to know 2e+06 in byte/minute unit or other unit size? below the screenshot : image

wahyubudiman avatar Mar 16 '22 06:03 wahyubudiman

hi @bersace , about the streaming replication lag threshold ,showing 2e+06 for warning threshold, i want to know 2e+06 in byte/minute unit or other unit size? below the screenshot : image

It's in bytes. temBoard should show the units of this metrics. Marking as bug.

bersace avatar Apr 08 '22 09:04 bersace

hi @bersace , about the streaming replication lag threshold ,showing 2e+06 for warning threshold, i want to know 2e+06 in byte/minute unit or other unit size? below the screenshot : image

Could you please open a new issue about unit in monitoring/alerting ?

bersace avatar Apr 08 '22 09:04 bersace

Fixed in 8.1

bersace avatar Oct 09 '23 12:10 bersace