Vidéos d'aide de Screencast

Socket Fail Issue

Created: 25 Janv. 2013 • Updated: 08 Fév. 2013 | 14 comments
l'image des mansoor.sheik
Ce problème a été résolu. Voir la solution.

Hi All,

Backup failed with socket failed issue (24). From detailed status found that it has intiaited the begin writing and failed with Connection error.

No data has been written to tape.

01/25/2013 14:32:28 - begin writing
01/25/2013 16:41:25 - Error bpbrm (pid=11293) from client Hostname: ERR - Cannot write to STDOUT. Errno = 104: Connection reset by peer

Netbackup Master Server:

Unix + 7.1 Version

Netbackup CLient

Linux + 6.5 Version

Commentaires CommentairesAccéder au dernier commentaire

l'image des RamNagalla

hi,

does it a standard or Data base backup?

Does it a single stream or Multi stream backup?

does it writing any data before failing jobs, or its failing without wirting any data?

what is the client read timeout in Media server?

l'image des mansoor.sheik

Hi Nagalla,

Thanks for your reply.

does it a standard or Data base backup?

Standard

Does it a single stream or Multi stream backup?

Single Stream

does it writing any data before failing jobs, or its failing without wirting any data?

No

what is the client read timeout in Media server?

7200

l'image des RamNagalla

Please post the failed job detail status and the bpcd log form the client with Verbose = 5 

and also try with Multistream backup to isolate the issue, and see if its able to make the streams, and if yes, all the streams are failing or any specific stream.. to find if that is specific file system or because of the network.

l'image des mansoor.sheik

Hi Nagalla,

Thanks for your update. But we fixed, with the below TN.

http://www.symantec.com/business/support/index?page=content&id=TECH57079.

As mentioned in the TN, My backup is very slow.

SOLUTION
l'image des Marianne

Have backups ever worked for this client?
Which Linux version? Why NBU 6.5? 32-bit unsupported OS?

Single stream backup of single filesystem or multiple?

Does Job Details tab show initial successful connection?
If so, we need client's bpbkar log as well.

Supporting Storage Foundation and VCS on Unix and Windows as well as NetBackup on Unix and Windows
Handy NBU Links

l'image des mansoor.sheik

Hi Marianne,

After Long time, We have intiated a backup for a request.

It is an ESX Server. 6.5

uname -a
Linux Client _servername 2.6.18-128.ESX #1 Fri Apr 10 00:08:17 PDT 2009 x86_64 x86_64 x86_64 GNU/LinuClient

Single Stream backup of single filesystem.

Does Job Details tab show initial successful connection? Yes.

------------------------------------------------------------------------------------------
01/25/2013 12:10:11 - Info nbjm (pid=8105) requesting STANDARD_RESOURCE resources from RB for backup job (jobid=4846, request id:{11515BEC-66BA-11E2-B549-00215AF8647B})
01/25/2013 12:10:11 - requesting resource STU1
01/25/2013 12:10:11 - requesting resource MasterServer_Name.NBU_CLIENT.MAXJOBS.CLient Server Name
01/25/2013 12:10:11 - requesting resource MasterServer_Name.NBU_POLICY.MAXJOBS.SAP_application_servers_backup
01/25/2013 12:10:11 - granted resource MasterServer_Name.NBU_CLIENT.MAXJOBS.CLient Server Name
01/25/2013 12:10:11 - granted resource MasterServer_Name.NBU_POLICY.MAXJOBS.Policy Name
01/25/2013 12:10:11 - granted resource TD1283
01/25/2013 12:10:11 - granted resource HP.ULTRIUM4-SCSI.000
01/25/2013 12:10:11 - granted resource STU1
01/25/2013 12:10:11 - estimated 0 kbytes needed
01/25/2013 12:10:11 - Info nbjm (pid=8105) started backup job for client CLient Server Name, policy policy Name, schedule Schedule NAme on storage unit STU1
01/25/2013 12:10:13 - started process bpbrm (pid=3645)
01/25/2013 12:10:23 - Info bpbrm (pid=3645) CLient Server Name is the host to backup data from
01/25/2013 12:10:23 - Info bpbrm (pid=3645) reading file list from client
01/25/2013 12:10:23 - connecting
01/25/2013 12:10:39 - Info bpbrm (pid=3645) starting bpbkar on client
01/25/2013 12:10:39 - Info bpbkar (pid=0) Backup started
01/25/2013 12:10:39 - Info bpbrm (pid=3645) bptm pid: 3689
01/25/2013 12:10:39 - connected; connect time: 0:00:00
01/25/2013 12:10:40 - Info bptm (pid=3689) start
01/25/2013 12:10:40 - Info bptm (pid=3689) using 65536 data buffer size
01/25/2013 12:10:40 - Info bptm (pid=3689) using 30 data buffers
01/25/2013 12:10:40 - Info bptm (pid=3689) start backup
01/25/2013 12:10:40 - Info bptm (pid=3689) backup child process is pid 3691
01/25/2013 12:10:40 - Info bptm (pid=3689) Waiting for mount of media id TD1283 (copy 1) on server MasterServer_Name.
01/25/2013 12:10:40 - mounting TD1283
01/25/2013 12:11:43 - Info bptm (pid=3689) media id TD1283 mounted on drive index 0, drivepath /dev/rtape/tape8_BESTnb, drivename HP.ULTRIUM4-SCSI.000, copy 1
01/25/2013 12:11:43 - mounted TD1283; mount time: 0:01:03
01/25/2013 12:11:43 - positioning TD1283 to file 5
01/25/2013 12:13:19 - positioned TD1283; position time: 0:01:36
01/25/2013 12:13:19 - begin writing
01/25/2013 14:19:07 - Error bpbrm (pid=3645) from client CLient Server Namet: ERR - Cannot write to STDOUT. Errno = 104: Connection reset by peer
01/25/2013 14:19:09 - Error bptm (pid=3689) media manager terminated by parent process
01/25/2013 14:19:23 - Info bpbkar (pid=0) done. status: 24: socket write failed
01/25/2013 14:19:23 - end writing; write time: 2:06:04
01/25/2013 14:29:23 - Info nbjm (pid=8105) starting backup job (jobid=4846) for client CLient Server Name, policy policy Name, schedule Schedule NAme
socket write failed (24)
-------------------------------------------------------------------------------------------------------------------

l'image des RamNagalla

01/25/2013 14:19:23 - Info bpbkar (pid=0) done. status: 24: socket write failed

bpbkar is not started, 

do you still have this problem, or did fixed as saying in above post?

l'image des mansoor.sheik

Hi Nagalla,

Backup is completed.Please find the Job details screnshot.

But it took 3 attempt to complete.

Pièce jointeTaille
Socket_Issue_24.doc 126.5 KO
l'image des RamNagalla

its just took around 36 GB, does it matching your data size in your File system?

Do you have decicated backup network or only single network for both backup and Production?

if you have decicated network could you make sure that the backup is using Dedicated network for data transport?

please provide the bpcd , bpkbar, bpbrm and bptm logs 

l'image des mansoor.sheik

Hi,

My backsize is 36 Gb. We dont have dedicated network for backup environment.
I have uploaded the logs. Can u guide me how to get bpkbar logs.

Pièce jointeTaille
Netbackup_Logs.zip 1.42 Mo
l'image des Mark_Solutions

It is bpbkar, not bpkbar - just need a directory under logs to create its log files.

Authorised Symantec Consultant

Don't forget to "Mark as Solution" if someones advice has solved your issue - and please bring back the Thumbs Up!!.

l'image des mansoor.sheik

Hi Mark_Solutions,

Thanks....

Hi Nagella,

I have taken a backup to decommission the server.
After backup completion i intimated to server team, and they have removed the server(ESX).

l'image des RamNagalla

opps, that was a typo... its bpbkar from Client.

as they removed the server , i guess you are done with your task.. ;-)

l'image des mansoor.sheik

Hi Nagalla/All,

My backup is completed.
Thanks for your support.