BUG REPORT: If a clustered server fails over during restore, the restore will hang. Subsequent attempts to restore will fail until action is taken on the master server.

Article:TECH60408  |  Created: 2008-01-09  |  Updated: 2008-01-09  |  Article URL http://www.symantec.com/docs/TECH60408
Article Type
Technical Solution


Environment

Issue



BUG REPORT: If a clustered server fails over during restore, the restore will hang. Subsequent attempts to restore will fail until action is taken on the master server.

Solution



Bug: 1250443

Detail/Symptoms:
When using a clustered media server, if a restore is in progress and the cluster fails over, not only does the restore hang, but any subsequent restore attempts fail to start.  Restores will fail to start even after the hung restore is cancelled.

Log Files:
In the Job Details for the hung restore, messages similar to the following may be seen:
4/29/2008 2:21:38 AM - Error nbjm(pid=2224) nbrb status: RB deallocated orphaned resources
4/29/2008 4:01:47 AM - Warning bprd(pid=3748) Restore must be resumed prior to first image expiration on 5/13/2008 1:35:47 AM

The following Job Details have also been observed:
4/29/2008 4:59:51 AM - Error nbjm(pid=2224) nbrb status: LTID reset media server resources
4/29/2008 5:03:09 AM - Warning bprd(pid=936) Restore must be resumed prior to first image expiration on 5/13/2008 4:16:11 AM

Subsequent restore Job Details will show no progression beyond the following details:
4/29/2008 4:12:37 AM - begin Restore
4/29/2008 4:12:38 AM - 1 images required
4/29/2008 4:12:38 AM - media CRZ007 required

Workaround:
bprd needs to be stopped and started on the master server to clear a resource allocation issue preventing new restores from starting.

1. Stop the bprd process.  This can be accomplished from the Daemons tab in the GUI, or via command line:
# /usr/openv/netbackup/bin/admincmd/bprdreq -terminate

2. Ensure there are no lingering bprd processes running.  If there are, kill them.

3. Restart bprd, either from the Daemons tab in the GUI, or via command line:
# /usr/openv/netbackup/bin/bprd

ETA of Fix:
Symantec has acknowledged that the above-mentioned issue (ETrack 1250443) is present in the current version(s) of the product(s) mentioned at the end of this article.  Symantec is committed to product quality and satisfied customers.  This issue was scheduled to be address in a future release update.  When it is released, please visit the following link for download and readme information:
 http://www.symantec.com/enterprise/support/overview.jsp?pid=15143


Supplemental Materials

SourceETrack
Value1250443
Descriptiononce doing failover during restore, restore cannot be done.(backup/restore is being done on tape library)


Legacy ID



303970


Article URL http://www.symantec.com/docs/TECH60408


Terms of use for this information are found in Legal Notices