IP address change for Master server in NetBackup 6.5.3
I need to do a DR test that involves a replica of our master server at a remote location. We use Doubletake to keep the replica in sync with the production master. In order to test, we d/c the NIC on the production server and failover to the replica which becomes the live master with a new IP address.
After a flushdns \ registerdns on the media and master servers, everything *seems* ok (the NBU Admin Console sees all of the servers, tape libraries, policies, etc). But when attempting to do a test restore to a client, it works up to a point and then just hangs. The restore job will start, identify the media needed for the restore, says that it 'connects' and then just sits there. The master server is pingable by name and from what I've seen in the log files so far, communication to the master server through the new IP address looks good.
If I do a similar test locally and create the replica here, it will work, but only if I use the exact same IP address. If I change the IP address at all, I see the same results as above after flushing & reregistering with DNS.
I'm leaning toward this being some sort of DNS issue, but maybe there's something more I'm missing in NBU?
Comments
IP Change
Does the job eventually fail? What is the status code? What kind of errors are you seeing in the bpbrm log of the media server, the bpcd log of both the media server and the client? Also, what happens when you use:
bpclntcmd -pn
bpclntcmd -hn
bpclntcmd -ip
on the client, media server and master?
Regards,
Benjamin Schmaus
NetBackup does not care about
NetBackup does not care about IP address. As long as hostname can be resolved to IP address and backwards (IP to hostname) at TCP/IP level, everything should work.
Use bpclntcmd to test comms in ALL directions as suggested above.
U can also use 'bptestbpcd -client <client_name>' to test connection to client as well as connect back.
If you have media servers, do the same test to clients.
Create bpcd log on client to troubleshoot comms problems.
Supporting Storage Foundation and VCS on Unix and Windows as well as NetBackup on Unix and Windows.
Handy NBU links
DNS
Try to use to use /etc/hosts as much as possible disable DNS. If you DNS master server does do name lookup and try to learn about the machines. you have to be sure that reserve name lookup everything is working fine.
alternative solution
Many NetBackup admins use local hosts file for NetBackup environments especially when private networks are being used.
Bob Stump VERITAS - "Ain't it the truth?" Incorrigible punster -- Do not incorrige
Did you flush DNS on client too?
Connecting/Connected message usually means connection to client, not to tthe media server.
You mentioned flush dns on master and media, but you did not mention anything about flushing DNS on client.
When it comes to restore, the NetBakcup Client will need to reverse-lookup the IP address of the replica master server and then verify it against the SERVER entries in registry/bp.conf.
In other words, if you only flush DNS on the master and media, not on the client, then the client will get some odd name from the reverse-lookup of the replica master's IP address, and it will then reject the restore connection from the replica master, thinking that "oh, this is not my master server, because the name I found from DNS does not match the master name in my configuration"
Well, if flushing DNS on the client does not resolve the issue, then I think as a last resort you might want to update the hosts file on all three - master and media and client.
After you update all 3 system's hosts files, you know the name resolution should definitely work (well, test pings by name, to be sure) and then see if the restore works.
This will at least confirm if the restore failure/hang is because of the changed IP address or something else (different firewall settings in replica DR network, or something like that)
Good luck,
Abe
Anger Management
Thanks for the tips folks,
Thanks for the tips folks, I'll give these a try the next time I'm able to failover to the test DR server (hopefully by tommorow).
Abesama, I thought I flushed DNS on the client machine as well, but I'm not positive I did there. I was trying to restore the client files to the media server, so I was thinking that might take the client out of the loop. I'll test that out as well.
Would you like to reply?
Login or Register to post your comment.