Video Screencast Help
Symantec to Separate Into Two Focused, Industry-Leading Technology Companies. Learn more.

have issue when backing up "/"

Created: 19 Feb 2013 • Updated: 19 Feb 2013 | 3 comments
This issue has been solved. See solution.

dear all

NBU version:7.5.0.3, CentOS release 5.7 (Final);

NBU master is in one site( Vegas), and the newly added media server is in another site(New York)...so it is remote media server....

we write the backups to EMC data domain. Data domain is in new York, the same site with remote media server.......

after the configuration is done, I set up a backup policy to test the backups.

here is the issue:

when I set the backup selection to be one or more small file list, for example, /etc/,home,/boot, then the backups work fine and can be finish sucessfully.

but when I add the root volume "/" to the backup selection ,then it will fail. I am not sure how to troubleshooting this?

and the failed backups exited with status code 13. here below is the detailed info:

===========================================================

2013-2-19 16:44:28 - Info bpbrm (pid=2987) lx0030nbumed01.active.local is the host to backup data from
2013-2-19 16:44:28 - Info bpbrm (pid=2987) reading file list from client
2013-2-19 16:44:29 - Info bpbrm (pid=2987) starting bpbkar on client
2013-2-19 16:44:29 - Info bpbkar (pid=3032) Backup started
2013-2-19 16:44:29 - Info bpbrm (pid=2987) bptm pid: 3033
2013-2-19 16:44:29 - Info bptm (pid=3033) start
2013-2-19 16:44:31 - Info bptm (pid=3033) using 262144 data buffer size
2013-2-19 16:44:31 - Info bptm (pid=3033) using 30 data buffers
2013-2-19 16:44:32 - Info bpbrm (pid=2987) from client lx0030nbumed01.active.local: TRV - [/selinux] is in a different file system from [/]. Skipping
2013-2-19 16:44:32 - Info bptm (pid=3033) start backup
2013-2-19 16:44:32 - Info bpbrm (pid=2987) from client lx0030nbumed01.active.local: TRV - [/misc] is in a different file system from [/]. Skipping
2013-2-19 16:44:33 - Info bpbrm (pid=2987) from client lx0030nbumed01.active.local: TRV - [/dd530-us0030/lx0030nbumed01-102] is on file system type NFS. Skipping
2013-2-19 16:44:33 - Info bpbrm (pid=2987) from client lx0030nbumed01.active.local: TRV - [/dd530-us0030/backup] is on file system type NFS. Skipping
2013-2-19 16:44:33 - Info bpbrm (pid=2987) from client lx0030nbumed01.active.local: TRV - [/dd530-us0030/ddvar] is on file system type NFS. Skipping
2013-2-19 16:44:34 - Info bpbrm (pid=2987) from client lx0030nbumed01.active.local: TRV - [/tmp/.font-unix/fs7100] is a socket special file. Skipping
2013-2-19 16:44:35 - Info bpbrm (pid=2987) from client lx0030nbumed01.active.local: TRV - [/var/run/pcscd.comm] is a socket special file. Skipping
2013-2-19 16:44:35 - Info bpbrm (pid=2987) from client lx0030nbumed01.active.local: TRV - [/var/run/sdp] is a socket special file. Skipping
2013-2-19 16:44:35 - Info bpbrm (pid=2987) from client lx0030nbumed01.active.local: TRV - [/var/run/acpid.socket] is a socket special file. Skipping
2013-2-19 16:44:36 - Info bpbrm (pid=2987) from client lx0030nbumed01.active.local: TRV - [/var/run/audispd_events] is a socket special file. Skipping
2013-2-19 16:44:36 - Info bpbrm (pid=2987) from client lx0030nbumed01.active.local: TRV - [/var/run/dbus/system_bus_socket] is a socket special file. Skipping
2013-2-19 16:44:36 - Info bpbrm (pid=2987) from client lx0030nbumed01.active.local: TRV - [/var/run/setrans/.setrans-unix] is a socket special file. Skipping
2013-2-19 16:44:37 - Info bpbrm (pid=2987) from client lx0030nbumed01.active.local: TRV - [/var/lib/nfs/rpc_pipefs] is in a different file system from [/]. Skipping
2013-2-19 16:44:39 - Info bpbkar (pid=3032) 4999 entries sent to bpdbm
2013-2-19 16:44:41 - Info bpbkar (pid=3032) 9999 entries sent to bpdbm
2013-2-19 16:44:44 - Info bpbkar (pid=3032) 14999 entries sent to bpdbm
2013-2-19 16:44:44 - Info bpbrm (pid=2987) from client lx0030nbumed01.active.local: TRV - [/usr/openv/var/vnetd/bpcompatd.uds] is a socket special file. Skipping
2013-2-19 16:44:45 - Info bpbrm (pid=2987) from client lx0030nbumed01.active.local: TRV - [/usr/openv/var/vnetd/terminate_bpcd.uds] is a socket special file. Skipping
2013-2-19 16:44:45 - Info bpbrm (pid=2987) from client lx0030nbumed01.active.local: TRV - [/usr/openv/var/vnetd/bpcd.uds] is a socket special file. Skipping
2013-2-19 16:44:45 - Info bpbrm (pid=2987) from client lx0030nbumed01.active.local: TRV - [/usr/openv/var/vnetd/vmd.uds] is a socket special file. Skipping
2013-2-19 16:44:46 - Info bpbrm (pid=2987) from client lx0030nbumed01.active.local: TRV - [/usr/openv/var/vnetd/terminate_vnetd.uds] is a socket special file. Skipping
2013-2-19 16:45:01 - Info nbjm (pid=28639) starting backup job (jobid=1220957) for client lx0030nbumed01.active.local, policy 0030_lx0030nbumed01, schedule Bi_weekly_full
2013-2-19 16:45:01 - Info nbjm (pid=28639) requesting STANDARD_RESOURCE resources from RB for backup job (jobid=1220957, request id:{C272D78E-7AF6-11E2-8DB2-DB8531524DAB})
2013-2-19 16:45:01 - requesting resource lx0030nbumed01-102_multi_dd530_rdsu01
2013-2-19 16:45:01 - requesting resource lx0034nbumast.NBU_CLIENT.MAXJOBS.lx0030nbumed01.active.local
2013-2-19 16:45:01 - requesting resource lx0034nbumast.NBU_POLICY.MAXJOBS.0030_lx0030nbumed01
2013-2-19 16:45:01 - granted resource  lx0034nbumast.NBU_CLIENT.MAXJOBS.lx0030nbumed01.active.local
2013-2-19 16:45:01 - granted resource  lx0034nbumast.NBU_POLICY.MAXJOBS.0030_lx0030nbumed01
2013-2-19 16:45:01 - granted resource  MediaID=@aaaaA;Path=/dd530-us0030/lx0030nbumed01-102;MediaServer=lx0030nbumed01.active.local
2013-2-19 16:45:01 - granted resource  lx0030nbumed01-102_multi_dd530_rdsu01
2013-2-19 16:45:02 - estimated 68697 kbytes needed
2013-2-19 16:45:02 - Info nbjm (pid=28639) started backup (backupid=lx0030nbumed01.active.local_1361321101) job for client lx0030nbumed01.active.local, policy 0030_lx0030nbumed01, schedule Bi_weekly_full on storage unit lx0030nbumed01-102_multi_dd530_rdsu01
2013-2-19 16:45:04 - started process bpbrm (pid=2987)
2013-2-19 16:45:05 - connecting
2013-2-19 16:45:05 - connected; connect time: 0:00:00
2013-2-19 16:45:09 - begin writing
2013-2-19 16:45:47 - Info bpbkar (pid=3032) 19999 entries sent to bpdbm
2013-2-19 16:46:06 - Info bpbkar (pid=3032) 24999 entries sent to bpdbm
2013-2-19 16:46:13 - Info bpbkar (pid=3032) 29999 entries sent to bpdbm
2013-2-19 16:46:22 - Info bpbkar (pid=3032) 34999 entries sent to bpdbm
2013-2-19 16:46:25 - Info bpbkar (pid=3032) 39999 entries sent to bpdbm
2013-2-19 16:46:30 - Info bpbkar (pid=3032) 44999 entries sent to bpdbm
2013-2-19 16:54:42 - Error bpbrm (pid=2987) socket read failed: errno = 62 - Timer expired
2013-2-19 16:59:43 - Error bptm (pid=3033) media manager terminated by parent process
2013-2-19 16:59:53 - Info bpbkar (pid=3032) done. status: 13: file read failed
2013-2-19 17:00:29 - end writing; write time: 0:15:20
file read failed  (13)
 

Comments 3 CommentsJump to latest comment

RamNagalla's picture

hi

what is the client read timeout vaule it the media server , ?

increase into 1200 or 1800 and see if that helps.

if not, keep the VERBOSE =5 in the client and send us the bpbkar log  from client.

SOLUTION
Ivy_Yang's picture

thank you Nagalla,

I increase it to 3600 now..and try again!

Ivy_Yang's picture

thank you Nagalla,

I increase it to 3600 now..and try again and seems works ....hahahah