Veritas Cluster Server User's Guide 5.0 RU3 - Linux for IBM Power

Article:DOC2390  |  Created: 2010-08-11  |  Updated: 2010-08-11  |  Article URL http://www.symantec.com/docs/DOC2390
Article Type
Documentation

Product(s)

Description



Table of Contents:
Clustering concepts and terminology
Introducing Veritas Cluster Server
What is a VCS cluster?
How VCS detects failure
How VCS ensures application availability
About switchover and failover
Can my application be clustered?
Defined start, stop, and monitor procedures
Ability to restart the application in a known state
External data storage
Licensing and host name issues
Physical components of VCS
Nodes
Shared storage
Networking
Logical components of VCS
Resources and resource dependencies
Categories of resources
Resource types
Service groups
Types of service groups
Failover service groups
Parallel service groups
Hybrid service groups
About the ClusterService group
About agents in VCS
Agent functions
Agent classifications
About the VCS agent framework
About cluster control, communications, and membership
About the high availability daemon (HAD)
About the HostMonitor daemon
About Group Membership Services/Atomic Broadcast (GAB)
About Low Latency Transport (LLT)
About the I/O fencing module
About security services
Components for administering VCS
Cluster Management Console
Cluster Manager (Java console)
VCS command lineerface (CLI)
Putting the pieces together
About cluster topologies
Basic failover configurations
Asymmetric or Active/Passive configuration
Symmetric or Active/Active configuration
N-to-1 configuration
Advanced failover configurations
N + 1 configuration
N-to-N configuration
Cluster topologies and storage configurations
Basic shared storage cluster
Campus, or Metropolitan, shared storage cluster
Shared nothing cluster
Replicated data cluster
Global cluster
VCS configuration concepts
About configuring VCS
About the VCS configuration language
About the main.cf file
About the SystemList attribute
Initial configuration
The types.cf file
About VCS attributes
Attribute data types
Attribute dimensions
Attributes and cluster objects
Attribute scope across systems: global and local attributes
Attribute life: temporary attributes
Size limitations for VCS objects
About VCS keywords and reserved words
VCS environment variables
Defining VCS environment variables
Administration-Putting VCS to work
About the VCS user privilege model
About VCS user privileges and roles
About VCS privilege levels
About user roles in VCS
About the hierarchy in VCS roles
User privileges for CLI commands
User privileges in global clusters
User privileges for clusters running in secure mode
How administrators assign roles to users
User privileges for OS user groups for clusters running in secure mode
About VCS privileges for users with multiple roles
Administering the cluster from the Cluster Management Console
About Veritas Cluster Management Console
Supported browsers
Configuring the Cluster Management Console manually
Sample configuration
Logging in to the Cluster Management Console
Logging out of the Cluster Management Console
Overview of the Cluster Management Console
Cluster Management Console layout
About the main tab bar
About the search bar
About the task pane
About the status pane
About quick navigation links
Icon conventions
Sorting the contents of a data table
Viewing multiple pages of a data table
Controlling view updates
Update (refresh) mode settings and icons
Administering users
About user listing tables
Sorting the list
Adding and deleting a user
Administering a cluster
Opening the configuration
Saving the configuration
Modifying cluster attributes
Administering service groups
Adding a service group to a cluster
Deleting a service group from a cluster
Modifying a service group
Modifying the system list of a service group
Bringing a service group online
Taking a service group offline
Switching a service group
Freezing a service group
Unfreezing a service group
Enabling a service group
Disabling a service group
Linking service groups
Viewing service group dependencies
Unlinking service groups
Adding systems to a service group
Deleting systems from a service group
Faulting a service group
Clearing a faulted service group
Modifying service group attributes
Flushing a service group
Adding a RemoteGroup resource from Cluster Management Console
Administering resources
Adding a resource to a service group
Overriding resource attributes
Enabling and disabling resources
Bringing a resource online
Taking a resource offline
Taking a resource offline and propagating the command
Faulting a resource
Clearing a faulted resource
Probing a resource
Deleting a resource from a service group
Linking resources
Viewing resource dependencies
Unlinking resources
Modifying a resource attribute
Invoking a resource action
Refreshing the ResourceInfo attribute
Clearing the ResourceInfo attribute
Administering resource types
Modifying resource type attributes
Administering systems
Freezing a System
Unfreezing a system
Modifying system attributes
Administering attributes
Viewing logs
Conducting a search
Administering the cluster from Cluster Manager (Java console)
About the Cluster Manager (Java Console)
Disability compliance
Getting started
Setting the display on UNIX systems
Using Java Console with secure shell
Starting Cluster Manager (Java console)
Reviewing components of the Java Console
Icons in the Java Console
About Cluster Monitor
Cluster monitor toolbar
Cluster monitor panels
Monitoring the cluster connection with Cluster Monitor
Monitoring VCS objects with Cluster Monitor
Expanding and collapsing the Cluster Monitor display
Customizing the Cluster Manager display
About Cluster Explorer
Cluster Explorer toolbar
Cluster Explorer configuration tree
Cluster Explorer view panel
Status view
Properties view
Service Group view
Resource view
Moving and linking icons in Service Group and Resource views
Zooming in on Service Group and Resource views
System Connectivity view
Remote Cluster Status view
Accessing additional features of the Java Console
Template view
System Manager
User Manager
Command Center
Configuration wizard
Notifier Resource Configuration wizard
Remote Group Resource Configuration Wizard
Cluster query
Logs
Server and user credentials
Administering Cluster Monitor
Configuring a new cluster panel
Modifying a cluster panel configuration
Logging on to and off of a cluster
Logging on to a cluster
Logging off of a cluster
Administering user profiles
Adding a user
Deleting a user
Changing a user password
Changing a user privilege
Assigning privileges for OS user groups for clusters running in secure mode
Administering service groups
Adding a service group
Deleting a service group
Bringing a service group online
Taking a service group offline
Switching a service group
Freezing a service group
Unfreezing a service group
Enabling a service group
Disabling a service group
Autoenabling a service group
Flushing a service group
Linking service groups
Unlinking service groups
Managing systems for a service group
Creating service groups with the configuration wizard
Administering resources
Adding a resource
Adding a RemoteGroup resource from the Java Console
Deleting a resource
Bringing a resource online
Taking a resource offline
Taking a resource offline and propagating the command
Probing a resource
Overriding resource type static attributes
Enabling resources in a service group
Disabling resources in a service group
Clearing a resource
Linking resources
Unlinking resources
Invoking a resource action
Refreshing the ResourceInfo attribute
Clearing the ResourceInfo attribute
Importing resource types
Running HA fire drill from the Java Console
Administering systems
Adding a system
Deleting a system
Freezing a system
Unfreezing a system
Administering clusters
Opening a cluster configuration
Saving a cluster configuration
Saving and closing a cluster configuration
Executing commands
Editing attributes
Querying the cluster configuration
Setting up VCS event notification using the Notifier wizard
Administering logs
Customizing the log display
Resetting the log display
Monitoring alerts
Administering VCS Simulator
Administering the cluster from the command line
About administering VCS from the command line
How VCS identifies the local system
About specifying values preceded by a dash (-)
About the -modify option
Encrypting VCS passwords
Encrypting agent passwords
Encrypting agent passwords using security keys
Privilege requirements generating security keys
Creating secure agent passwords
Granting password encryption privileges to group administrators
Changing the security key
Installing a VCS license
Starting VCS
When VCS Starts
Stopping VCS
Stopping VCS without -force option
Stopping VCS with options other than -force
Controlling the hastop behavior using the EngineShutdown attribute
Additional considerations for stopping VCS
Logging on to VCS
Managing VCS configuration files
About the hacf utility
About multiple versions of .cf files
Verifying a configuration
Scheduling automatic backups for VCS configuration files
Saving a configuration
Setting the configuration to read/write
Formatting configuration files
Taking snapshots of VCS configuration files
Backing up configuration files
Restoring VCS configuration files
Viewing snapshots of configuration files
Viewing files changed after a snapshot
Comparing a file with its snapshot copy
Exporting snapshots
Adding and removing files for snapshots
Excluding files from snapshots
Deleting snapshots
Managing VCS users from the command line
Adding a user
Assigning and removing user privileges
Modifying a user
Deleting a user
Displaying a user
Querying VCS
Querying service groups
Querying resources
Querying resource types
Querying agents
Querying systems
Querying clusters
Querying status
Querying log data files (LDFs)
Using conditional statements to query VCS objects
Administering service groups
Adding and deleting service groups
Modifying service group attributes
About modifying the SystemList attribute
Bringing service groups online
Taking service groups offline
Switching service groups
Freezing and unfreezing service groups
Enabling and disabling service groups
Clearing faulted resources in a service group
Flushing service groups
Linking and unlinking service groups
Administering agents
Administering resources
About adding resources
Adding resources
Deleting resources
Adding, deleting, and modifying resource attributes
Defining attributes as local
Linking and unlinking resources
Bringing resources online
Taking resources offline
Probing a resource
Clearing a resource
Administering systems
Administering clusters
Retrieving version information
Adding and removing systems
Setting cluster attributes from the command line
Initializing cluster attributes in the configuration file
Enabling and disabling Security Services
Administering resource types
Adding, deleting, and modifying resource types
Overriding resource type static attributes
Initializing resource type scheduling and priority attributes
Setting Scheduling and Priority attributes
Using the -wait option in scripts
Running HA fire drills
Administering simulated clusters from the command line
Administering I/O fencing
About the vxfentsthdw utility
General guidelines for using vxfentsthdw
vxfentsthdw options
Testing the shared disks using the -m option
Testing the coordinator disk group using vxfentsthdw -c
Performing non-destructive testing on the disks using the -r option
Testing the shared disks listed in a file using the -f option
Testing all the disks in a diskgroup using the -g option
Testing a disk with existing keys
About the vxfenadm utility
About the registration key format
Verifying the nodes see the same disk
About the vxfenclearpre utility
Removing preexisting keys
About the vxfenswap utility
Replacing I/O fencing coordinator disks when the cluster is online
Replacing the coordinator diskgroup in a cluster that is online
Changing the disk interaction policy in a cluster that is online
Adding disks from a recovered site to the coordinator diskgroup
Refreshing lost keys on coordinator disks
Configuring applications and resources in VCS
About configuring resources and applications
About VCS bundled agents
Storage agents
Network agents
File share agents
Services and Applications agents
VCS infrastructure and support agents
Testing agents
Which agents should I use?
Configuring application service groups on Linux
Prerequisites
Running the wizard
Configuring NFS service groups on Linux
Prerequisites
Running the wizard
Configuring the RemoteGroup agent
About the ControlMode attribute
OnOff
MonitorOnly
OnlineOnly
Example: Configuring a RemoteGroup resource
Service group behavior with the RemoteGroup agent
Bringing the Apache service group online
Unexpected offline of the database service group
Taking the Apache service group offline
Testing resource failover using HA fire drills
About HA fire drills
Running an HA fire drill
Predicting VCS behavior using VCS Simulator
About VCS Simulator
Installing VCS Simulator
Reviewing the installation
Simulator ports
Administering VCS Simulator from the Java Console
Starting VCS Simulator from the Java Console
Creating a simulated cluster
Deleting a cluster
Starting a simulated cluster
Verifying a simulated cluster configuration
Simulating a global cluster configuration
Bringing a system up
Powering off a system
Saving the offline configuration
Simulating a resource fault
Simulating cluster faults in global clusters
Simulating failed fire drills
Administering VCS Simulator from the command line
Starting VCS Simulator from the command line
Administering simulated clusters from the command line
VCS communication and operations
About communications, membership, and data protection in the cluster
About cluster communications
About intra-system communications
About inter-system cluster communications
Group Membership Services/Atomic Broadcast (GAB)
Low Latency Transport (LLT)
About cluster membership
Initial joining of systems to cluster membership
Seeding a new cluster
Ongoing cluster membership
About membership arbitration
Components of membership arbitration
Fencing module
Coordinator disks
How the fencing module starts up
How membership arbitration works
About data protection
SCSI-3 Persistent Reservation
Examples of VCS operation with I/O fencing
About the I/O fencing algorithm
Two system cluster where one system fails
Four system cluster where cluster interconnect fails
How I/O fencing works in different event scenarios
About cluster membership and data protection without I/O fencing
About jeopardy
About Daemon Down Node Alive (DDNA)
Examples of VCS operation without I/O fencing
Four system cluster without a low priority link
Cluster interconnect link failure
Cluster interconnect link failure followed by system failure
All high priority cluster interconnect links fail
Four system cluster with low priority link
Cluster interconnect link failure
Cluster interconnect link failure followed by system failure
All high priority cluster interconnect links fail
Summary of best practices for cluster communications
The role of service group dependencies
About service group dependencies
Dependency links
Dependency categories: online or offline dependencies
Dependency location: local, global, or remote dependencies
Dependency rigidity: soft, firm, or hard dependencies
Dependency limitations
Service group dependency configurations
Failover parent / Failover child
Failover parent / Parallel child
Parallel parent / Failover child
Parallel parent / Parallel child
Group Dependency FAQs
Linking service groups
VCS behavior with service group dependencies
Online operations in group dependencies
Offline operations in group dependencies
Switch operations in group dependencies
Controlling VCS behavior
About VCS behavior on resource faults
About critical and non-critical resources
VCS behavior diagrams
Scenario: Resource with critical parent faults
Scenario: Resource with non-critical parent faults
Scenario: Resource with critical parent fails to come online
Controlling VCS behavior at the service group level
About the AutoRestart attribute
Controlling failover on service group or system faults
Defining failover policies
About system zones
About load-based autostart
Freezing service groups
Controlling Clean behavior on resource faults
Clearing resources in the ADMIN_WAIT state
Controlling fault propagation
Customized behavior diagrams
Scenario: Resource with a critical parent and ManageFaults=NONE
Scenario: Resource with a critical parent and FaultPropagation=0
VCS behavior for resources that support the intentional offline functionality
About the IntentionalOffline attribute
About the ExternalStateChanged attribute
Controlling VCS behavior at the resource level
About resource type attributes that control resource behavior
About the RestartLimit attribute
About the OnlineRetryLimit attribute
About the ConfInterval attribute
About the ToleranceLimit attribute
About the FaultOnMonitorTimeouts attribute
How VCS handles resource faults
VCS behavior when an online resource faults
VCS behavior when a resource fails to come online
VCS behavior after a resource is declared faulted
Disabling resources
When to disable a resource
Limitations of disabling resources
Additional considerations for disabling resources
How disabled resources affect group states
Changing agent file paths and binaries
VCS behavior on loss of storage connectivity
About disk group configuration and VCS behavior
How VCS attributes control behavior on loss of storage connectivity
About VCS behavior when a disk group is disabled
Recommendations to ensure application availability
Service group workload management
Enabling service group workload management
About system capacity and service group load
Static load versus dynamic load
About overload warning
About system limits and service group prerequisites
Using capacity and limits
Sample configurations depicting workload management
System and Service group definitions
Sample configuration: Basic four-node cluster
AutoStart operation
Failure scenario
Cascading failure scenario
Sample configuration: Complex four-node cluster
AutoStart operation
Normal operation
Failure scenario
Cascading failure scenario
Sample configuration: Server consolidation
AutoStart operation
Normal operation
Failure scenario
Cascading failure scenario
Administration-Beyond the basics
VCS event notification
About VCS event notification
Event messages and severity levels
Persistent and replicated message queue
How HAD deletes messages
Components of VCS event notification
The notifier process
Example of notifier command
The hanotify utility
Example of hanotify command
VCS events and traps
Events and traps for clusters
Events and traps for agents
Events and traps for resources
Events and traps for systems
Events and traps for service groups
SNMP-specific files
Trap variables in VCS MIB
severityId
entityType and entitySubType
entityState
Monitoring aggregate events
How to detect service group failover
How to detect service group switch
Detecting complementary events
Configuring notification
VCS event triggers
About VCS event triggers
Using event triggers
List of event triggers
cpuusage event trigger
injeopardy event trigger
loadwarning event trigger
nofailover event trigger
postoffline event trigger
postonline event trigger
preonline event trigger
resadminwait event trigger
resfault event trigger
resnotoff event trigger
resstatechange event trigger
sysoffline event trigger
unable_to_restart_agent event trigger
unable_to_restart_had event trigger
violation event trigger
Cluster configurations for disaster recovery
Connecting clusters- Creating global clusters
How VCS global clusters work
VCS global clusters: The building blocks
Visualization of remote cluster objects
Global service groups
Global cluster management
Wide-area connector process
Wide-area heartbeats
Serialization-The Authority attribute
Authority and AutoStart
Resiliency and Right of wayŽ
VCS agents to manage wide-area failover
DNS agent
RVG agent
RVGPrimary agent
RVGSnapshot agent
The Steward process: Split-brain in two-cluster global clusters
Secure communication in global clusters
Prerequisites for global clusters
Cluster setup
Application setup
Wide-area heartbeats
ClusterService group
Replication setup
Prerequisites for clusters running in secure mode
Setting up a global cluster
Preparing the application for the global environment
Modifying the ClusterService group for global clusters
Configuring replication resources in VCS
Linking the application and replication service groups
Configuring the second cluster
Linking clusters
Configuring the Steward process (optional)
Configuring the global service group
When a cluster faults
Declaring the type of failure
Switching the service group back to the primary
Setting up a fire drill
Configuring the fire drill service group
Verifying a successful fire drill
Scheduling a fire drill
Multi-tiered application support using the RemoteGroup agent in a global environment
Test scenario for a multi-tiered environment
The main.cf file for cluster 1
The main.cf file for cluster 2
The main.cf file for cluster 3
The main.cf file for cluster 4
Administering global clusters from the Cluster Management Console
About creating global service groups
Administering global clusters
Adding a remote cluster
Deleting a remote cluster
Administering global service groups
Converting local service groups to global service groups
Bringing a remote service group online
Taking a remote service group offline
Switching a remote service group
Administering global heartbeats
Adding a global heartbeat
Deleting a global heartbeat
Modifying a global heartbeat
Administering global clusters from Cluster Manager (Java console)
About global clusters
Adding a remote cluster
Deleting a remote cluster
Administering global service groups
Converting local and global groups
Bringing a service group online in a remote cluster
Taking a service group offline in a remote cluster
Switching a service group to a remote cluster
Administering global heartbeats
Adding a global heartbeat
Modifying a global heartbeat
Deleting a global heartbeat
Administering global clusters from the command line
About administering global clusters from the command line
Global querying
Querying global cluster service groups
Querying resources
Querying systems
Querying clusters
Querying status
Querying heartbeats
Administering global service groups
Administering resources
Administering clusters in global clusters
Changing the cluster name
Administering heartbeats
Setting up replicated data clusters
About replicated data clusters
How VCS replicated data clusters work
Setting up a replicated data cluster configuration
Typical replicated data cluster configuration
Setting up replication
Configuring the service groups
Configuring the service group dependencies
Migrating a service group
Switching the service group
Setting up a fire drill
Setting up campus clusters
About campus cluster configuration
VCS campus cluster requirements
Typical VCS campus cluster setup
How VCS campus clusters work
About I/O fencing in campus clusters
Setting up a campus cluster configuration
Preparing to set up a campus cluster configuration
Configuring I/O fencing to prevent data corruption
Configuring VxVM diskgroups for campus cluster configuration
Configuring VCS service group for campus clusters
About fire drill in campus clusters
About the DiskGroupSnap agent
Running a fire drill in a campus cluster
Configuring the fire drill service group
Running a successful fire drill in a campus cluster
Troubleshooting and performance
VCS performance considerations
How cluster components affect performance
Kernel components (GAB and LLT)
The VCS engine (HAD)
How agents impact performance
Monitoring resource type and agent configuration
Additional considerations for agents
The VCS graphical user interfaces
How cluster operations affect performance
Booting a cluster system
When a resource comes online
When a resource goes offline
When a service group comes online
When a service group goes offline
When a resource fails
When a system fails
When a network link fails
When a system panics
Client process failure
Network failure
Quick reopen
When a service group switches over
When a service group fails over
Scheduling class and priority configuration
Priority ranges
Default scheduling classes and priorities
Monitoring CPU usage
VCS agent statistics
Tracking monitor cycle times
VCS attributes enabling agent statistics
About VXFEN tunable parameters
Configuring the VXFEN parameters
Troubleshooting and recovery for VCS
Logging
Message catalogs
Troubleshooting the VCS engine
HAD diagnostics
DNS configuration issues cause GAB to kill HAD
Seeding and I/O Fencing
Preonline IP check
Troubleshooting VCS startup
Troubleshooting service groups
Troubleshooting resources
Troubleshooting I/O fencing
Node is unable to join cluster while another node is being ejected
vxfentsthdw fails when SCSI TEST UNIT READY command fails
Removing existing keys from disks
System panics to prevent potential data corruption
How vxfen driver checks for pre-existing split brain condition
Case 1: system 2 up, system 1 ejected (actual potential split brain)
Case 2: system 2 down, system 1 ejected (apparent potential split brain)
Clearing keys after split brain using vxfenclearpre command
Registered keys are lost on the coordinator disks
Replacing defective disks when the cluster is offline
The vxfenswap utility faults when echo or cat is used in .bashrc file
Troubleshooting notification
Troubleshooting VCS configuration backup and restore
Troubleshooting and recovery for global clusters
Disaster declaration
Lost heartbeats and the inquiry mechanism
VCS alerts
Types of alerts
Managing alerts
Actions associated with alerts
Negating events
Concurrency violation at startup
Troubleshooting licensing
Validating license keys
Licensing error messages
Appendixes
VCS user privileges- administration matrices
About administration matrices
Administration matrices
Agent Operations (haagent)
Attribute Operations (haattr)
Cluster Operations (haclus, haconf)
Service group operations (hagrp)
Heartbeat operations (hahb)
Log operations (halog)
Resource operations (hares)
System operations (hasys)
Resource type operations (hatype)
User operations (hauser)
Cluster and system states
Remote cluster states
Examples of cluster state transitions
System states
Examples of system state transitions
VCS attributes
About attributes and their definitions
Resource attributes
Resource type attributes
Service group attributes
System attributes
Cluster attributes
Heartbeat attributes (for global clusters)
Administering Symantec Web Server
About Symantec Web Server
Getting Started
Configuring ports for VRTSweb
About VRTSweb ports
Retrieving the list of VRTSweb ports
Adding ports for VRTSweb
Deleting ports
Changing the administrative port
Managing VRTSweb SSL certificates
Viewing SSL certificate information
Creating a self-signed SSL certificate
Exporting the SSL certificate to a file
Configuring a CA-signed SSL certificate
Cloning the VRTSweb SSL certificate
Configuring SMTP notification for VRTSweb
Retrieving the name of the configured SMTP server
Setting the SMTP server
Retrieving SMTP settings
Adding an SMTP recipient
Deleting an SMTP recipient
Configuring logging for VRTSweb
Retrieving log levels
Modifying log levels for VRTSweb
Modifying size limit and rollover count for VRTSweb logs
Modifying the maximum heap size for VRTSweb
Accessibility and VCS
About accessibility in VCS
Navigation and keyboard shortcuts
Navigation in the Java Console
Navigation in the Web Console
Support for accessibility settings
Support for assistive technologies
Glossary
Index

Attachments

Veritas Cluster Server User's Guide 5.0 RU3 - Linux for IBM Power
vcs_users_lin_50ru3.pdf (5 MBytes)

Article URL http://www.symantec.com/docs/DOC2390


Terms of use for this information are found in Legal Notices