Name

ocf:ethmonitor — Monitors network interfaces

Synopsis

OCF_RESKEY_interface=string OCF_RESKEY_name=string [OCF_RESKEY_multiplier=integer] [OCF_RESKEY_repeat_count=integer] [OCF_RESKEY_repeat_interval=integer] [OCF_RESKEY_pktcnt_timeout=integer] [OCF_RESKEY_arping_count=integer] [OCF_RESKEY_arping_timeout=integer] [OCF_RESKEY_arping_cache_entries=integer] ethmonitor [start | stop | status | monitor | meta-data | validate-all]

Description

Monitor the vitality of a local network interface. You may setup this RA as a clone resource to monitor the network interfaces on different nodes, with the same interface name. This is not related to the IP adress or the network on which a interface is configured. You may use this RA to move resources away from a node, which has a faulty interface or prevent moving resources to such a node. This gives you independend control of the resources, without involving cluster intercommunication. But it requires your nodes to have more than one network interface. The resource configuration requires a monitor operation, because the monitor does the main part of the work. In addition to the resource configuration, you need to configure some location contraints, based on a CIB attribute value. The name of the attribute value is configured in the 'name' option of this RA. Example constraint configuration: location loc_connected_node my_resource_grp rule ="rule_loc_connected_node" -INF: ethmonitor eq 0 The ethmonitor works in 3 different modes to test the interface vitality. 1. call ip to see if the link status is up (if link is down -> error) 2. call ip an watch the RX counter (if packages come around in a certain time -> success) 3. call arping to check wether any of the IPs found in the lokal ARP cache answers an ARP REQUEST (one answer -> success) 4. return error

Supported Parameters

OCF_RESKEY_interface=Network interface name

The name of the network interface which should be monitored (e.g. eth0).

OCF_RESKEY_name=Attribute name

The name of the CIB attribute to set. This is the name to be used in the constraints. Defaults to "ethmonitor-'interface_name'".

OCF_RESKEY_multiplier=Multiplier for result variable

Multiplier for the value of the CIB attriobute specified in parameter name.

OCF_RESKEY_repeat_count=Monitor repeat count

Specify how often the interface will be monitored, before the status is set to failed. You need to set the timeout of the monitoring operation to at least repeat_count * repeat_interval

OCF_RESKEY_repeat_interval=Monitor repeat interval in seconds

Specify how long to wait in seconds between the repeat_counts.

OCF_RESKEY_pktcnt_timeout=packet counter timeout

Timeout for the RX packet counter. Stop listening for packet counter changes after the given number of seconds.

OCF_RESKEY_arping_count=Number of arpings per IP

Number of ARP REQUEST packets to send for every IP. Usually one ARP REQUEST (arping) is send

OCF_RESKEY_arping_timeout=Timeout for arpings per IP

Time in seconds to wait for ARP REQUESTs (all packets of arping_count). This is to limit the time for arp requests, to be able to send requests to more than one node, without running in the monitor operation timeout.

OCF_RESKEY_arping_cache_entries=Number of ARP cache entries to try

Maximum number of IPs from ARP cache list to check for ARP REQUEST (arping) answers. Newest entries are tried first.