sexigraf icon indicating copy to clipboard operation
sexigraf copied to clipboard

Actionable Alerts from the experiences of the community here

Open danilansible opened this issue 2 years ago • 2 comments

Hi Guys,

I'm trying to comeup with an actionable list of alerts, like VMs inaccessible, vsan percentused are some them that came to my mind. However would like to check with the community here for any suggestions for few more and the valuable feedback please.

@rschitz @vmdude and other team, Kudos again for the new version and all the improvements/features.

Thank you

danilansible avatar Jan 17 '23 13:01 danilansible

Hi @danilansible and thanks for your support. You'd have to create a custom dashboard with all of those and create an alarm for each. You could also check the new "unified" grafana alarm system to create standalone alert without the need for attached graph but the alarm would only be text : https://grafana.com/docs/grafana/v8.5/alerting/unified-alerting/

rschitz avatar Jan 17 '23 14:01 rschitz

Thanks @rschitz

Btw Below are some of the event alerts that sound important. I couldnt find license expiry events or trigger events anywhere, any idea if those are pulled and if yes where to locate

              "target": "vi.*.vi.exec.ExEvent.*.*.com_vmware_vc_ha_VmRestartedByHAEvent",
              "target": "vi.*.vi.exec.ExEvent.*.*.*HA_Cluster*Event",
              "target": "vi.*.vi.exec.ExEvent.*.*.com_vmware_vc_HA_HostAgentErrorEvent",
              "target": "vi.*.vi.exec.ExEvent.*.*.msg*",
              "target": "vmw.*.*.*.quickstats.drs.DrsScore"
              "target": "vi.*.vi.exec.ExEvent.*.*.esx_audit_maintenancemode_*",
              "target": "vi.*.vi.esx.ExEvent.*.*.esx_audit*reason*",
              "target": "vi.*.vi.exec.ExEvent.*.*.esx_clear*restored",
              "target": "vi.*.vi.exec.ExEvent.*.*.esx_*_restored",
              "target": "vi.*.vi.exec.ExEvent.*.*.esx_problem_net_vmnic_link*",
              "target": "vi.*.vi.exec.ExEvent.*.*.Host*ion*Failed*Event",
              "target": "vi.*.vi.exec.ExEvent.*.*.HostSyncFailedEvent",
              "target": "vi.*.vi.exec.ExEvent.*.*.esx*los?",
              "target": "vi.*.vi.exec.ExEvent.*.*.esx_problem_vm_kill_unexpected_fault_failure_2",
              "target": "vi.*.vi.exec.ExEvent.*.*.HostConnectionLostEvent",
              "target": "vi.*.vi.exec.ExEvent.*.*.esx_problem_dhclient_lease_none",
              "target": "vi.*.vi.exec.ExEvent.*.*.esx_audit_account_locked",
              "target": "vsan.*.*.*.esx.*.vsan.net.port*DropsRate",
              "target": "vsan.*.*.*.esx.*.vsan.net.*PacketsLossRate",
              "target": "vsan.*.*.*.esx.*.vsan.net.tcp*ErrRate",
              "target": "vmw.*.*.*.runtime.vm.dead",
              "target": "esx.*.*.*.vm.*.fatstats.cpu_ready_summation",
              "target": "esx.*.*.*.vm.*.fatstats.netUsage",
              "target": "vi.*.vi.exec.*.*.*.VmMacConflictEvent",
              "target": "vi.*.vi.exec.ExEvent.*.*.NotEnoughResourcesToStartVmEvent",
              "target": "vmw.*.*.*.esx.*.fatstats.overallCpuUtilization",
              "target": "vmw.*.*.*.esx.*.fatstats.overallmemUtilization",
              "target": "vmw.*.*.*.vm.*.quickstats.BalloonedMemory",
              "target": "vmw.*.*.*.vm.*.quickstats.SwappedMemory",
              "target": "vi.*.vi.exec.ExEvent.*.*.com_vmware_vc_HA_InsufficientFailoverLevelEvent",
              "target": "vi.*.vi.exec.ExEvent.*.*.com_vmware_vc_HardwareSensorEvent",
              "target": "vi.*.vi.exec.ExEvent.*.*.BadUsernameSessionEvent",
              "target": "vi.*.vi.exec.ExEvent.*.*.ClusterOvercommittedEvent",
              "target": "vi.*.vi.exec.ExEvent.*.*.*Resources*Event",
              "target": "vsan.*.*.*.esx.*.vsan.disk.capacity.*.percentUsed"
              "target": "vsan.*.*.*.vsan.*.totalRecoveryETA",
              "target": "vi.*.vi.exec.*.*.*.esx_problem_vob_vsan_lsom_diskunhealthy",

danilansible avatar Feb 06 '23 10:02 danilansible