simplemonitor icon indicating copy to clipboard operation
simplemonitor copied to clipboard

failed monitors alert when dependent monitor succeeds after failure

Open mikbro2021 opened this issue 4 years ago • 1 comments

If we have a device (#1) that fails and then a device (#2) that #1 depends on fails and recovers, we get an alert for the dependent device (#1).

To explain more fully, an AP is hanging off a switch. The AP monitor 'depends' on the the switch.

The AP fails and we get an correctly alert. The switch then fails and we correctly get an alert for the switch. The switch recovers and we incorrectly get an alert for the AP. The system should 'remember' that the AP was down before the switch went down/up.

mikbro2021 avatar Jul 29 '21 15:07 mikbro2021

Ah yeah I think I see why this is happening... nonw I need to figure out the best way to fix it :)

(I think it's because "skipped" monitors, those with failed dependencies, internally pretend to have succeeded, so when the switch fails, the AP monitor is marked as suceeded, which means when the switch is up again, the AP monitor failure is seen as new.)

jamesoff avatar Jul 29 '21 16:07 jamesoff