Default override preventing heartbeat failure alert.

We had a server that went down and didn’t generate alerts for Heartbeat Failure or Could not Connect to Computer.

What I’d found is that there is a group in SCOM called
“Managed Computer Client Health Service Watcher Group” and there is a default
override to disable  generating alerts for Heartbeat Failure or Could not
Connect to Computer against this group.

This group is apparently intended for workstations being monitored by SCOM and is dynamically populated but sometimes servers also ended up in there.

I you don’t monitor workstations the easiest solution is to create a second override to enable those alerts and just enforce it.

Loading

2 thoughts on “Default override preventing heartbeat failure alert.

    1. Warren Kahn Post author

      Heartbeat Failure servers as a trigger for Computer Not Reachable so I don’t think you shoudl disable it. Perhaps just lower the alert severity or alternativly create a new monitor to replace Computer Not Reachable. you could even just use a ping alert from the xping pack as an example.

      Reply

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.