Configuring Resource Monitoring

Although the High Availability Extension can detect a node failure, it also has the ability to detect when an individual resource on a node has failed. If you want to ensure that a resource is running, you must configure resource monitoring for that resource. Resource monitoring consists of specifying a timeout and/or start delay value, and an interval. The interval tells the CRM how often it should check the resource status.

Procedure 4.8. Adding or Modifying Monitor Operations

  1. Start the Linux HA Management Client and log in to the cluster as described in Section 4.1, “Linux HA Management Client”.

  2. In the Linux HA Management Client main window, click Resources in the left pane to see the resources already configured for the cluster.

  3. In the right pane, select the resource to modify and click Edit. The next window shows the basic resource parameters and the meta attributes, instance attributes and operations already defined for that resource.

  4. To add a new monitor operation, select the respective tab and click Add.

    To modify an existing operation, select the respective entry and click Edit.

  5. Enter a unique ID for the monitor operation. When modifying existing monitor operations, the ID is already defined and is displayed in the configuration dialog.

  6. In Name, select the action to perform, for example monitor, start, or stop.

  7. In the Interval field, enter a value in seconds.

  8. In the Timeout field, enter a value in seconds. After the specified timeout period, the operation will be treated as failed. The PE will decide what to do or execute what you specified in the On Fail field of the monitor operation.

  9. If needed, set optional parameters, like On Fail (what do if this action ever fails?) or Requires (what conditions need to be satisfied before this action occurs?).

  10. If all parameters are set according to your wishes, click OK to finish the configuration of that resource. The configuration dialog is closed and the main window shows the modified resource.

If you do not configure resource monitoring, resource failures after a successful start will not be communicated, and the cluster will always show the resource as healthy.

If the resource monitor detects a failure, the following actions will take place: