The software modifications from version to version are outlined in detail in the following sections. This summary indicates, for example, whether basic settings have been completely reconfigured, configuration files have been moved to other places, or other significant changes happened.
With SUSE Linux Enterprise Server 11, the cluster stack has changed from Heartbeat to OpenAIS. OpenAIS implements an industry standard API, the Application Interface Specification (AIS), published by the Service Availability Forum. The cluster resource manager from SUSE Linux Enterprise Server 10 has been retained but has been significantly enhanced, ported to OpenAIS and is now known as Pacemaker.
For more details what changed in the High Availability components from SUSE® Linux Enterprise Server 10 SP3 to SUSE Linux Enterprise Server 11, refer to the following sections.
The High Availability Extension now comes with the concept of a migration threshold and
failure timeout. You can define a number of failures for resources,
after which they will migrate to a new node. By default, the node
will no longer be allowed to run the failed resource until the
administrator manually resets the resource’s failcount. However it
is also possible to expire them by setting the resource’s
failure-timeout option.
You can now set global defaults for resource options and operations.
Often it is desirable to preview the effects of a series of changes before updating the configuration atomically. You can now create a “shadow” copy of the configuration that can be edited with the command line interface, before committing it and thus changing the active cluster configuration atomically.
Rules, instance_attributes, meta_attributes and sets of operations can be defined once and referenced in multiple places.
The CIB now accepts XPath-based create,
modify, delete operations. For
more information, refer to the cibadmin help text.
For creating a set of collocated resources, previously you could
either define a resource group (which could not always accurately
express the design) or you could define each relationship as an
individual constraint—causing a constraint explosion as the
number of resources and combinations grew. Now you can also use an
alternate form of collocation constraints by defining
resource_sets.
Provided Pacemaker is installed on a machine, it is possible to connect to the cluster even if the machine itself is not a part of it.
By default, recurring actions are scheduled relative to when the resource started, but this is not always desirable. To specify a date/time that the operation should be relative to, set the operation’s interval-origin. The cluster uses this point to calculate the correct start-delay such that the operation will occur at origin + (interval * N).
All resource and cluster options now use dashes (-) instead of
underscores (_). For example, the master_max meta
option has been renamed to master-max.
master_slave Resource
The master_slave resource has been renamed to
master. Master resources are a special type of
clone that can operate in one of two modes.
The attributes container tag has been removed.
The pre-req operation field has been renamed
requires.
All operations must have an interval. For start/stop actions the
interval must be set to 0 (zero).
The attributes of collocation and ordering constraints were renamed for clarity.
The resource-failure-stickiness cluster option has
been replaced by the migration-threshold cluster
option. See also Migration Threshold and Failure Timeouts.
The arguments for command-line tools have been made consistent. See also Naming Conventions for Resource and Custer Options.
The cluster configuration is written in XML. Instead of a Document
Type Definition (DTD), now a more powerful RELAX NG schema is
used to define the pattern for the structure and content.
libxml2 is used as parser.
id Fields
id fields are now XML IDs which have the following
limitations:
IDs cannot contain colons.
IDs cannot begin with a number.
IDs must be globally unique (not just unique for that tag).
Some fields (such as those in constraints that refer to resources) are IDREFs. This means that they must reference existing resources or objects in order for the configuration to be valid. Removing an object which is referenced elsewhere will therefor fail.
It is no longer possible to set resource meta-options as top-level attributes. Use meta attributes instead. See also crm_resource(8).
Resource and operation defaults are no longer read from
crm_config.
The main cluster configuration file has changed from
/etc/ais/openais.conf to
/etc/corosync/corosync.conf. Both files are very
similar. When upgrading from SUSE Linux Enterprise High Availability Extension 11 to SP1, a script takes
care of the minor differences between those files. For more
information about the relationship between OpenAIS and Corosync, see
.
In order to migrate existing clusters with minimal downtime, SUSE Linux Enterprise High Availability Extension allows you to perform a “rolling upgrade” from SUSE Linux Enterprise High Availability Extension 11 to 11 SP1. The cluster is still online while you upgrade one node after the other.
For easier cluster deployment, AutoYaST allows you to clone existing nodes. AutoYaST is a system for installing one or more SUSE Linux Enterprise systems automatically and without user intervention, using an AutoYaST profile that contains installation and configuration data. The profile tells AutoYaST what to install and how to configure the installed system to get a completely ready-to-use system in the end. This profile can be used for mass deployment in different ways.
SUSE Linux Enterprise High Availability Extension ships with Csync2, a tool for replication of configuration files across all nodes in the cluster. It can handle any number of hosts and it is also possible to synchronize files among certain subgroups of hosts only. Use YaST to configure the hostnames and the files that should be synchronized with Csync2.
The High Availability Extension now also includes the HA Web Konsole, a Web-based user interface for management tasks. It allows you to monitor and administer your Linux cluster also from non-Linux machines. It is also an ideal solution in case your system does not provide or allow a graphical user interface.
When using the command line interface to create and configure resources, you can now choose from various resource templates for quicker and easier configuration.
By defining the capacity a certain node provides, the capacity a certain resource requires and by choosing one of several placement strategies in the cluster, resources can be placed according to their load impact to prevent decrease of cluster performance.
It is now possible to create disaster-resilient storage configurations
from two independent SANs, using
cmirrord.
For easier migration from GFS2 to OCFS2, you can mount your GFS2 file systems in read-only mode to copy the data to an OCFS2 file system. OCFS2 is fully supported by SUSE Linux Enterprise High Availability Extension.
If redundant rings are configured, OCFS2 and DLM can automatically use redundant communication paths via SCTP, independent of network device bonding.
For additional layers of security in protecting your storage from data
corruption, you can use a combination of IO fencing (with the
external/sbd fencing device) and the
sfex resource agent
to ensure exclusive storage access.
The High Availability Extension now supports CTDB, the cluster implementation of the trivial database. This allows you configure a clustered Samba server—providing an High Availability solution also for heterogeneous environments.
The new module allows configuration of kernel-based load balancing
with a graphical user interface. It is a front-end for
ldirectord, a
user-space daemon for managing Linux Virtual Server and monitoring the real servers.