LanMan Resource fails unexpectedly (Error V-16-2-13067)
Hi all,
I am currently installing a 3 node Netbackup 7.1 cluster, on a VSF HA 5.1 SP2 on Windows 2008 R2 (SP1).
The installation went fine and failover of the Netbackup resource back and forth shows no problems whatsoever.
This weekend however, for no apparent reason, the LanMan Resource failed on the first node and the entire Netbackup resource was failed over to the next available node.
In the Eventlog I see:
Agent is calling clean for resource(NetBackup_Server-Lanman) because the resource became OFFLINE unexpectedly, on its own.
and in the VCS Log:
May 22, 2011 5:51:21 AM V-16-2-13067 (SRV0401) Agent is calling clean for resource(NetBackup_Server-Lanman) because the resource became OFFLINE unexpectedly, on its own. V-16-2-13067
(SRV0401) Agent is calling clean for resource(NetBackup_Server-Lanman) because the resource became OFFLINE unexpectedly, on its own.
There were no jobs running (Not configured yet) nor was anybody working on the machine when it happened.
After the failover, the first node was left in a FAILED state...
All Systems are running:
Windows 2008 R2 Enterprise, SP1, Fully patched, x64
VSF HA Windows 5.1 SP2, x64
Netbackup 7.1, x64
I've included some logs with this post. I hope someone can help with this problem!
Thanks in advance for any input you might have...
Fred
Comments 2 Comments • Jump to latest comment
The logs provided do not provide sufficient details to determined why the Lanman agent faulted on its own.
This error will be displayed for any resource under cluster control/monitor cycle. If VCS is not able to monitor the resouce as online. http://www.symantec.com/docs/TECH70812
You can try to increased logging for the lanman agent to see if we are able to log any specific errors, but you will have to wait until the issue reproduces.
TECHnote to increase logging. http://www.symantec.com/docs/TECH67017
If the agent comes online and stays online for undetermined amount of time. you may want to start looking at connectivity issues with AD.
If its a timing issue you may consider increasing the "RestartLimit" for the agent" as per http://www.symantec.com/docs/TECH54737 this may add a little more tolerance.
In most cases we see VCS is just reacting and its a result of issues with AD.
Hi Ireyes,
Thank you for the info you provided! I will indeed increase logging as you have suggested, hoping that if it occurs again, I am able to provide more info...
I've also opened a case with Symantec ( Case 414-760-213 )
They suggested the following (After receiving VxExplorer Logging). Maybe it can help others reading this as well:
Fred
Would you like to reply?
Login or Register to post your comment.