SUSE Linux Enterprise High Availability Extension

High Availability Guide

Legal Notice

Contents

About This Guide
1. Feedback
2. Documentation Conventions
I. Installation and Setup
1. Conceptual Overview
1.1. Product Features
1.2. Product Benefits
1.3. Cluster Configurations
1.4. Architecture
1.5. What's New?
2. Getting Started
2.1. Hardware Requirements
2.2. Software Requirements
2.3. Shared Disk System Requirements
2.4. Preparations
2.5. Overview: Installing and Setting Up a Cluster
3. Installation and Basic Setup with YaST
3.1. Installing the High Availability Extension
3.2. Initial Cluster Setup
3.3. Bringing the Cluster Online
II. Configuration and Administration
4. Configuring Cluster Resources with the GUI
4.1. Linux HA Management Client
4.2. Creating Cluster Resources
4.3. Creating STONITH Resources
4.4. Configuring Resource Constraints
4.5. Specifying Resource Failover Nodes
4.6. Specifying Resource Failback Nodes (Resource Stickiness)
4.7. Configuring Resource Monitoring
4.8. Starting a New Cluster Resource
4.9. Removing a Cluster Resource
4.10. Configuring a Cluster Resource Group
4.11. Configuring a Clone Resource
4.12. Migrating a Cluster Resource
4.13. For More Information
5. Configuring Cluster Resources From Command Line
5.1. Command Line Tools
5.2. Debugging Your Configuration Changes
5.3. Creating Cluster Resources
5.4. Creating a STONITH Resource
5.5. Configuring Resource Constraints
5.6. Specifying Resource Failover Nodes
5.7. Specifying Resource Failback Nodes (Resource Stickiness)
5.8. Configuring Resource Monitoring
5.9. Starting a New Cluster Resource
5.10. Removing a Cluster Resource
5.11. Configuring a Cluster Resource Group
5.12. Configuring a Clone Resource
5.13. Migrating a Cluster Resource
5.14. Testing with Shadow Configuration
5.15. For More Information
6. Setting Up a Simple Testing Resource
6.1. Configuring a Resource with the GUI
6.2. Manual Configuration of a Resource
7. Adding or Modifying Resource Agents
7.1. STONITH Agents
7.2. Writing OCF Resource Agents
8. Fencing and STONITH
8.1. Classes of Fencing
8.2. Node Level Fencing
8.3. STONITH Configuration
8.4. Monitoring Fencing Devices
8.5. Special Fencing Devices
8.6. For More Information
III. Storage and Data Replication
9. Oracle Cluster File System 2
9.1. Features and Benefits
9.2. Management Utilities and Commands
9.3. OCFS2 Packages
9.4. Creating an OCFS2 Volume
9.5. Mounting an OCFS2 Volume
9.6. Additional Information
10. Cluster LVM
10.1. Configuration of cLVM
10.2. Configuring Eligible LVM2 Devices Explicitly
10.3. For More Information
11. Distributed Replicated Block Device (DRBD)
11.1. Installing DRBD Services
11.2. Configuring the DRBD Service
11.3. Testing the DRBD Service
11.4. Troubleshooting DRBD
11.5. Additional Information
IV. Troubleshooting and Reference
12. Troubleshooting
12.1. Installation Problems
12.2. Debugging a HA Cluster
12.3. FAQs
12.4. Fore More Information
13. Cluster Management Tools
cibadmin — Provides direct access to the cluster configuration
crmadmin — controls the Cluster Resource Manager
crm_attribute — Allows node attributes and cluster options to be queried, modified and deleted
crm_diff — identify changes to the cluster configuration and apply patches to the configuration files
crm_failcount — Manage the counter recording each resource's failures
crm_master — Manage a master/slave resource's preference for being promoted on a given node
crm_mon — monitor the cluster's status
crm_node — Lists the members of a cluster
crm_resource — Perform tasks related to cluster resources
crm_shadow — Perform Configuration Changes in a Sandbox Before Updating The Live Cluster
crm_standby — manipulate a node's standby attribute to determine whether resources can be run on this node
crm_verify — check the CIB for consistency
14. Cluster Resources
14.1. Supported Resource Agent Classes
14.2. OCF Return Codes
14.3. Resource Options
14.4. Resource Operations
14.5. Instance Attributes
15. HA OCF Agents
ocf:anything_ra — anything
ocf:apache — Apache web server
ocf:AudibleAlarm — AudibleAlarm resource agent
ocf:ClusterMon — ClusterMon resource agent
ocf:db2 — db2 resource agent
ocf:Delay — Delay resource agent
ocf:drbd — This resource agent manages a Distributed Replicated Block Device (DRBD) object as a master/slave resource. DRBD is a mechanism for replicating storage; please see the documentation for setup details.
ocf:Dummy — Dummy resource agent
ocf:eDir88 — eDirectory resource agent
ocf:Filesystem — Filesystem resource agent
ocf:ICP — ICP resource agent
ocf:ids — OCF resource agent for the IBM's database server called Informix Dynamic Server (IDS)
ocf:IPaddr2 — Manages virtual IPv4 addresses
ocf:IPaddr — Manages virtual IPv4 addresses
ocf:IPsrcaddr — IPsrcaddr resource agent
ocf:IPv6addr — manages IPv6 alias
ocf:iscsi — iscsi resource agent
ocf:Ldirectord — Wrapper OCF Resource Agent for ldirectord
ocf:LinuxSCSI — LinuxSCSI resource agent
ocf:LVM — LVM resource agent
ocf:MailTo — MailTo resource agent
ocf:ManageRAID — Manages RAID devices
ocf:ManageVE — OpenVZ VE resource agent
ocf:mysql — MySQL resource agent
ocf:nfsserver — nfsserver
ocf:oracle — oracle resource agent
ocf:oralsnr — oralsnr resource agent
ocf:pgsql — pgsql resource agent
ocf:pingd — pingd resource agent
ocf:portblock — portblock resource agent
ocf:Pure-FTPd — OCF Resource Agent compliant FTP script.
ocf:Raid1 — RAID1 resource agent
ocf:Route — Manages network routes
ocf:rsyncd — OCF Resource Agent compliant rsync daemon script.
ocf:SAPDatabase — SAP database resource agent
ocf:SAPInstance — SAP instance resource agent
ocf:scsi2reserve — scsi-2 reservation
ocf:SendArp — SendArp resource agent
ocf:ServeRAID — ServeRAID resource agent
ocf:sfex — SF-EX resource agent
ocf:SphinxSearchDaemon — searchd resource agent
ocf:Squid — The RA of Squid
ocf:Stateful — Example stateful resource agent
ocf:SysInfo — SysInfo resource agent
ocf:tomcat — tomcat resource agent
ocf:VIPArip — Virtual IP Address by RIP2 protocol
ocf:VirtualDomain — Manages virtual domains
ocf:WAS6 — WAS6 resource agent
ocf:WAS — WAS resource agent
ocf:WinPopup — WinPopup resource agent
ocf:Xen — Manages Xen DomUs
ocf:Xinetd — Xinetd resource agent
V. Appendix
A. GNU Licenses
A.1. GNU General Public License
A.2. GNU Free Documentation License
Terminology

List of Figures

1.1. Three-Server Cluster
1.2. Three-Server Cluster after One Server Fails
1.3. Typical Fibre Channel Cluster Configuration
1.4. Typical iSCSI Cluster Configuration
1.5. Typical Cluster Configuration Without Shared Storage
1.6. Architecture
4.1. Connecting to the Cluster
4.2. Linux HA Management Client - Main Window
4.3. Linux HA Management Client - Constraints
4.4. Linux HA Management Client - Groups
4.5. Group Resource

List of Tables

9.1. OCFS2 Utilities
13.1. Overview of Internal Commands
14.1. Failure Recovery Types
14.2. OCF Return Codes
14.3. Options for a Primitive Resource
14.4. Resource Operations

List of Examples

4.1. Resource Group for a Web Server
8.1. Testing Configuration
8.2. Testing Configuration
8.3. Testing Configuration
8.4. Configuration of an IBM RSA Lights-out Device
8.5. Configuration of an UPS Fencing Device
12.1. Stopped Resources