Monitoring VMware

VMware monitoring methodology

VMware environments are monitored via webservices. This requires that the ServiceNav Box is able to access webservices via https. Both ESX hosts and vCenter servers provide webservices.

To retrieve information from these web services, it is recommended to create a read-only user on the ESX hosts and/or vCenter console.

Prerequisite: Creating a VMware monitoring user

Once the user is created in ESX or vCenter, in ServiceNav navigate to  “Configuration” menu, “General” and then “Monitoring accounts”, and add the VMware user created above.

Important: If ESX hosts are managed by vCenter, monitoring of datastores must be done by deploying the checkpoint: “VMware-Datacenter-Datastore” for each datastore to montor, managed by vCenter.

In VCenter, data is updated about datastore usage every 30 minutes . In an ESX server, the usage update is random and hence monitoring is less reliable.

vCenter monitoring

Create a monitoring account in VMware vCenter

ESX(i) monitoring

Configuration

Create a monitoring account in VMware

Checkpoints

VMware-ESX-CPUMeasures CPU load as a  percentage, and notifies when a threshold is exceeded (warning and critical).
VMware-ESX-DiskIO-ReadMeasures the Read latency on ESX / ESXi . Notifies when a threshold is exceeded.
VMware-ESX-DiskIO-WriteMeasures the Write latency disks on ESX / ESXi . Notifies when a threshold is exceeded..
VMware-ESX-HardwareCheck the condition of the hardware elements (fans, drives, power supplies, etc.) of a VMWare ESX server. Prerequisites: VMWARE plugins for specific manufacturers (HP Insight Manager, Dell’s Open Manage, IBM IMM).
VMware-ESX-NetUsageMeasuring the use of network interfaces on ESX Server.
VMware-ESX-RAMMeasure the RAM usage of an ESX server and notifies when a threshold is exceeded.
VMware-ESX-Runtime_IssuesDisplays a list of all current incidents on the virtualization host.
VMware-ESX-Runtime_statusIndicates the general state of the virtualization host with four status codes: grey / green / red / yellow ..
VMware-ESX-ServicesCheck the status of ESXi services specified in settings and indicate a CRITICAL state when at least one service in this list is not started.
VMware-ESX-Services-WithExclusionCheck the status of ESXi services which  must be started (except those specified in settings) and enters the CRITICAL state when at least one service is not started.
VMware-ESX-SWAPMeasure the use of swap space to an ESX / ESXi server and notifies when a threshold is exceeded.
VMware-ESX-VM-ListList the virtual machines on ESX and indicate their status (UP / DOWN). The status of the checkpoint does not change if the status of a virtual machine changes.

VM monitoring

Configuration

Create a monitoring account: VMware vCenter
Provide the name of the vCenter server
Managing VMs through vCenter. The device name must be the same in ServiceNav as in VMware.

VMware-Datacenter-VM_ActiveRAMMeasure the  active memory consumed by a VM by questioning vCenter. This checkpoint notifies when exceeding thresholds expressed in MB
VMware-Datacenter-VM_BalooningMeasuring memory use by the VM memory control driver by querying vCenter and notifies when exceeding thresholds expressed in MB
VMware-Datacenter-VM-ConfigIssuesDetects VM configuration problems by querying  vCenter.
VMware-Datacenter-VM-ConnectionStateCollecting the connection status of a VM by querying vCenter and changes state if the state is not OK.
VMware-Datacenter-VM_CPUMeasuring CPU usage of a VM by querying vCenter and notifies when a threshold is exceeded.
VMware-Datacenter-VM_CPUReadyMeasuring  free CPU time of a VM in milliseconds by querying  vCenter and notifies when a threshold is exceeded. Free CPU time is the time during which the CPU of a virtual machine is ready for execution and waiting on a physical CPU.
VMware-Datacenter-VM_CPUWaitMeasuring the use of CPU time waiting of a VM in milliseconds by querying  vCenter and notifies when a threshold is exceeded. The CPU wait time is the time during which the CPU a virtual machine is waiting for resources plus idle time.
VMware-Datacenter-VM_DiskIO-ReadMeasuring the use of IO disk read by a VM, by querying vCenter. This check notifies when exceeding thresholds expressed in MB/s.
VMware-Datacenter-VM_DiskIO-WriteMeasuring the use of IO disk write a VM, by querying vCenter. This check notifies when exceeding thresholds expressed in MB/s.
VMware-Datacenter-VM-GuestStatusGet the operating status of the guest OS in a VM by querying vCenter and changes state if the state is not OK.
VMware-Datacenter-VM_NetUsageMeasure network utilization by VM, querying vCenter. This control notifies when thresholds expressed in kilobytes per second are exceeded.
VMware-Datacenter-VM_OverheadRAMMeasure the use of additional memory per VM due to virtualization of resources, by querying the vCenter. This control notifies when  thresholds expressed in MB are exceeded.

UK ServiceNav Product Development Manager; my priority is to be needful of the particular requirements of all ‘English-speaking’ markets where ServiceNav is sold. I have over 20 years experience of the IT monitoring field - covering a wide variety of products and technologies.