Video Screencast Help

restore one node image of sql cluster

Created: 02 Jul 2013 • Updated: 02 Jul 2013 | 9 comments

hi everyone

 

i have taken an image of the active node in an sql cluster using BESR 2010 from a vmware enviroment and deployed it on another machine in our lab. the windows boots fine and in failover cluster manager i can bring all disks and network ip online but when i try to bring the name of any service online it failes and i get this event in event viewer

event 1146

The cluster resource host subsystem (RHS) stopped unexpectedly. An attempt will be made to restart it. This is usually due to a problem in a resource DLL. Please determine which resource DLL is causing the issue and report the problem to the resource vendor.

 

i have applied all the patches that microsoft advise but no luck.

any ideas how to bring the services up on this node after restoring the image

 

kind regards

Operating Systems:

Comments 9 CommentsJump to latest comment

Markus Koestler's picture

Which OS does the cluster run on ? 2003 or 2008 R2 or 2008 ?

*** Please mark thread as solved if you consider this to have answered your question(s) ***

Markus Koestler's picture

Have you applied this hotfix ? http://support.microsoft.com/kb/978527/en-us

*** Please mark thread as solved if you consider this to have answered your question(s) ***

Afahmy's picture

yes i did

 

this is the last entries in cluster.log

000005cc.00000bec::2013/07/02-08:33:38.384 INFO  [ClNet] Adapter isatap.{D12A5AC0-23B9-4D16-B6F2-D11F8EFC6DBF} RFC2863 operational status = 2.
000005cc.00000bec::2013/07/02-08:33:38.384 DBG   [ClNet] Created adapter: DeviceGuid:     5A1FDAE8-6770-47C6-97D3-B438F9B26728
000005cc.00000bec::2013/07/02-08:33:38.384 DBG   [ClNet]                  DeviceName:     Microsoft ISATAP Adapter
000005cc.00000bec::2013/07/02-08:33:38.384 DBG   [ClNet]                  ConnectoidName: isatap.{D12A5AC0-23B9-4D16-B6F2-D11F8EFC6DBF}
000005cc.00000bec::2013/07/02-08:33:38.384 DBG   [ClNet]                  Netbios/TCP:    0
000005cc.00000bec::2013/07/02-08:33:38.384 DBG   [ClNet]                  DNS Suffix:
000005cc.00000bec::2013/07/02-08:33:38.384 ERR   [ClNet] No valid interfaces for adapter 5A1FDAE8-6770-47C6-97D3-B438F9B26728.
000005cc.00000bec::2013/07/02-08:33:38.384 ERR   [ClNet] ClRtlpCreateAdapter 3 failed, status 87
000005cc.00000bec::2013/07/02-08:33:38.384 INFO  [ClNet] Adapter isatap.{3F7EE7B5-47DC-4154-B24D-2A1761ECB231} RFC2863 operational status = 2.
000005cc.00000bec::2013/07/02-08:33:38.384 DBG   [ClNet] Created adapter: DeviceGuid:     B9111867-D43B-49A2-B0C8-E09AA8C146F3
000005cc.00000bec::2013/07/02-08:33:38.384 DBG   [ClNet]                  DeviceName:     Microsoft ISATAP Adapter #2
000005cc.00000bec::2013/07/02-08:33:38.384 DBG   [ClNet]                  ConnectoidName: isatap.{3F7EE7B5-47DC-4154-B24D-2A1761ECB231}
000005cc.00000bec::2013/07/02-08:33:38.384 DBG   [ClNet]                  Netbios/TCP:    0
000005cc.00000bec::2013/07/02-08:33:38.384 DBG   [ClNet]                  DNS Suffix:
000005cc.00000bec::2013/07/02-08:33:38.384 DBG   [ClNet]                  DnsServer:      192.168.4.244
000005cc.00000bec::2013/07/02-08:33:38.384 INFO  [ClNet] Adapter isatap.{3F9080BE-0A04-40C7-9EA6-6E55FF2E1F21} RFC2863 operational status = 2.
000005cc.00000bec::2013/07/02-08:33:38.384 DBG   [ClNet] Created adapter: DeviceGuid:     F2AB0919-D48F-4CBE-929D-E1EE6DEB9368
000005cc.00000bec::2013/07/02-08:33:38.384 DBG   [ClNet]                  DeviceName:     Microsoft ISATAP Adapter #3
000005cc.00000bec::2013/07/02-08:33:38.384 DBG   [ClNet]                  ConnectoidName: isatap.{3F9080BE-0A04-40C7-9EA6-6E55FF2E1F21}
000005cc.00000bec::2013/07/02-08:33:38.384 DBG   [ClNet]                  Netbios/TCP:    0
000005cc.00000bec::2013/07/02-08:33:38.384 DBG   [ClNet]                  DNS Suffix:
000005cc.00000bec::2013/07/02-08:33:38.384 DBG   [ClNet]                  DnsServer:      fec0:0:0:ffff::1%1
000005cc.00000bec::2013/07/02-08:33:38.384 DBG   [ClNet]                  DnsServer:      fec0:0:0:ffff::2%1
000005cc.00000bec::2013/07/02-08:33:38.384 DBG   [ClNet]                  DnsServer:      fec0:0:0:ffff::3%1
000005cc.00000bec::2013/07/02-08:33:38.384 INFO  [RES] Network Name <Cluster Name>: adapter Bank Conn (1)
000005cc.00000bec::2013/07/02-08:33:38.399 WARN  [RES] Network Name <Cluster Name>: Trying to remove credentials for LocalSystem returned status C0000225, STATUS_NOT_FOUND is a non-critical failure for a remove operation
000005cc.00000bec::2013/07/02-08:33:38.431 INFO  [RES] Network Name <Cluster Name>: Initiating the Network Name operation : 'Verifying computer object associated with network name resource SQLServer'
000005cc.00000bec::2013/07/02-08:33:38.431 INFO  [RES] Network Name <Cluster Name>: Trying to find computer account SQLSERVER object GUID(29b2497489bf4d4b851a1095db6f136c) on any available domain controller.
000005cc.00000bec::2013/07/02-08:33:38.727 INFO  [RES] Network Name <Cluster Name>: Found computer account SQLSERVER on domain controller \\AD1.afrexim.com.
000005cc.00000bec::2013/07/02-08:33:38.727 INFO  [RES] Network Name <Cluster Name>: Trying to obtain the VSToken for Core Cluster Name resource
000005cc.00000bec::2013/07/02-08:33:38.836 INFO  [RES] Network Name <Cluster Name>: Able to logon with secondary password (Version: 1 IsProposed: 0).
000005cc.00000bec::2013/07/02-08:33:38.836 INFO  [RES] Network Name <Cluster Name>: Starting credentials update locally.
00000680.00000c10::2013/07/02-08:33:38.961 ERR   [RCM] rcm::RcmMonitor::RecoverProcess: Recovering monitor process 1484 / 0x5cc
00000680.00000c10::2013/07/02-08:33:38.961 INFO  [RCM] Created monitor process 1676 / 0x68c
0000068c.000002e8::2013/07/02-08:33:38.977 INFO  [RHS] Initializing.
00000680.00000c10::2013/07/02-08:33:38.977 INFO  [RCM] rcm::RcmResource::ReattachToMonitorProcess: (Cluster Name, OnlinePending)
00000680.00000c10::2013/07/02-08:33:38.977 INFO  [RCM] TransitionToState(Cluster Name) OnlinePending-->ProcessingFailure.
00000680.00000c10::2013/07/02-08:33:38.977 INFO  [RCM] rcm::RcmGroup::UpdateStateIfChanged: (Cluster Group, Pending --> Failed)
00000680.00000c10::2013/07/02-08:33:38.977 ERR   [RCM] rcm::RcmResource::HandleFailure: (Cluster Name)
00000680.00000c10::2013/07/02-08:33:38.977 INFO  [RCM] resource Cluster Name: failure count: 3, restartAction: 2.
00000680.00000c10::2013/07/02-08:33:38.977 INFO  [RCM] TransitionToState(Cluster Name) ProcessingFailure-->[WaitingToTerminate to Failed].
00000680.00000c10::2013/07/02-08:33:38.977 INFO  [RCM] rcm::RcmGroup::UpdateStateIfChanged: (Cluster Group, Failed --> Pending)
00000680.00000c10::2013/07/02-08:33:38.977 INFO  [RCM] TransitionToState(Cluster Name) [WaitingToTerminate to Failed]-->[Terminating to Failed].
0000068c.00000b54::2013/07/02-08:33:38.977 INFO  [RHS] Waiting for Open call for Cluster Name to complete.
00000680.00000c10::2013/07/02-08:33:38.977 INFO  [RCM] Resource Cluster Name is causing group Cluster Group to failover.  Posting worker thread.
00000680.00000fd8::2013/07/02-08:33:38.977 INFO  [RCM] rcm::RcmGroup::Failover: (Cluster Group)
0000068c.00000834::2013/07/02-08:33:38.992 INFO  [RES] Network Name <Cluster Name>: NetNameOpen Invoked
0000068c.00000834::2013/07/02-08:33:38.992 INFO  [RES] Network Name <Cluster Name>: Successful open of resid 1314912
0000068c.00000b54::2013/07/02-08:33:38.992 INFO  [RES] Network Name <Cluster Name>: Terminating resource...
00000680.00000c64::2013/07/02-08:33:38.992 INFO  [RCM] HandleMonitorReply: OPENRESOURCE for 'Cluster Name', gen(40) result 0.
00000680.00000a10::2013/07/02-08:33:38.992 INFO  [RCM] HandleMonitorReply: TERMINATERESOURCE for 'Cluster Name', gen(41) result 0.
00000680.00000a10::2013/07/02-08:33:38.992 INFO  [RCM] TransitionToState(Cluster Name) [Terminating to Failed]-->Failed.
00000680.00000a10::2013/07/02-08:33:38.992 INFO  [RCM] rcm::RcmGroup::UpdateStateIfChanged: (Cluster Group, Pending --> Failed)
00000680.00000fd8::2013/07/02-08:33:39.008 WARN  [RCM] Not failing over group Cluster Group, failoverCount 1, failover threshold 4294967295, nodeAvailCount 0.
00000680.00000fd8::2013/07/02-08:33:39.008 INFO  [RCM] Will retry online of Cluster Name in 3600000 milliseconds.
00000680.00000758::2013/07/02-08:33:39.398 INFO  [ACCEPT] 0.0.0.0:~3343~: Accepted inbound connection from remote endpoint 10.1.1.12:~52281~.
00000680.00000fd8::2013/07/02-08:33:39.398 INFO  [SV] Securing route from (10.1.1.15:~3343~) to remote  (10.1.1.12:~52281~).
00000680.00000fd8::2013/07/02-08:33:39.398 INFO  [SV] Got a new incoming stream from 10.1.1.12:~52281~
00000680.00000fd8::2013/07/02-08:33:39.398 ERR       000007fe:fdbbaa7d( ERROR_MOD_NOT_FOUND(126) )
00000680.00000fd8::2013/07/02-08:33:39.398 ERR       00000000:0288f3d0( ERROR_MOD_NOT_FOUND(126) )
00000680.00000fd8::2013/07/02-08:33:39.398 ERR       00000000:00cdf260( ERROR_MOD_NOT_FOUND(126) )
00000680.00000fd8::2013/07/02-08:33:39.398 ERR       00000001:000aa178( ERROR_MOD_NOT_FOUND(126) )
00000680.00000fd8::2013/07/02-08:33:39.398 WARN  mscs::ListenerWorker::operator (): HrError(0x8009030c)' because of '[SV] Authentication or Authorization Failed'

Markus Koestler's picture

Hm, I really should think you'd consult Microsoft with this.

*** Please mark thread as solved if you consider this to have answered your question(s) ***

Afahmy's picture

ok does taking an image of a server and deploy it using restore anyware could damage the failover cluster virtual adapter?

Markus Koestler's picture

I dont know to be honest.

*** Please mark thread as solved if you consider this to have answered your question(s) ***

Markus Koestler's picture

Have you been able to resolve this issue in the meantime ?

*** Please mark thread as solved if you consider this to have answered your question(s) ***

Afahmy's picture

No but i have installed non clustered sql on another server and attached the databases