Video Screencast Help
Symantec to Separate Into Two Focused, Industry-Leading Technology Companies. Learn more.

Socket Fail Issue

Created: 25 Jan 2013 • Updated: 08 Feb 2013 | 14 comments
mansoor.sheik's picture
This issue has been solved. See solution.

Hi All,

Backup failed with socket failed issue (24). From detailed status found that it has intiaited the begin writing and failed with Connection error.

No data has been written to tape.

01/25/2013 14:32:28 - begin writing
01/25/2013 16:41:25 - Error bpbrm (pid=11293) from client Hostname: ERR - Cannot write to STDOUT. Errno = 104: Connection reset by peer

Netbackup Master Server:

Unix + 7.1 Version

Netbackup CLient

Linux + 6.5 Version

Comments 14 CommentsJump to latest comment

RamNagalla's picture

hi,

does it a standard or Data base backup?

Does it a single stream or Multi stream backup?

does it writing any data before failing jobs, or its failing without wirting any data?

what is the client read timeout in Media server?

mansoor.sheik's picture

Hi Nagalla,

Thanks for your reply.

does it a standard or Data base backup?

Standard

Does it a single stream or Multi stream backup?

Single Stream

does it writing any data before failing jobs, or its failing without wirting any data?

No

what is the client read timeout in Media server?

7200

RamNagalla's picture

Please post the failed job detail status and the bpcd log form the client with Verbose = 5 

and also try with Multistream backup to isolate the issue, and see if its able to make the streams, and if yes, all the streams are failing or any specific stream.. to find if that is specific file system or because of the network.

mansoor.sheik's picture

Hi Nagalla,

Thanks for your update. But we fixed, with the below TN.

http://www.symantec.com/business/support/index?page=content&id=TECH57079.

As mentioned in the TN, My backup is very slow.

SOLUTION
Marianne's picture

Have backups ever worked for this client?
Which Linux version? Why NBU 6.5? 32-bit unsupported OS?

Single stream backup of single filesystem or multiple?

Does Job Details tab show initial successful connection?
If so, we need client's bpbkar log as well.

Supporting Storage Foundation and VCS on Unix and Windows as well as NetBackup on Unix and Windows
Handy NBU Links

mansoor.sheik's picture

Hi Marianne,

After Long time, We have intiated a backup for a request.

It is an ESX Server. 6.5

uname -a
Linux Client _servername 2.6.18-128.ESX #1 Fri Apr 10 00:08:17 PDT 2009 x86_64 x86_64 x86_64 GNU/LinuClient

Single Stream backup of single filesystem.

Does Job Details tab show initial successful connection? Yes.

------------------------------------------------------------------------------------------
01/25/2013 12:10:11 - Info nbjm (pid=8105) requesting STANDARD_RESOURCE resources from RB for backup job (jobid=4846, request id:{11515BEC-66BA-11E2-B549-00215AF8647B})
01/25/2013 12:10:11 - requesting resource STU1
01/25/2013 12:10:11 - requesting resource MasterServer_Name.NBU_CLIENT.MAXJOBS.CLient Server Name
01/25/2013 12:10:11 - requesting resource MasterServer_Name.NBU_POLICY.MAXJOBS.SAP_application_servers_backup
01/25/2013 12:10:11 - granted resource MasterServer_Name.NBU_CLIENT.MAXJOBS.CLient Server Name
01/25/2013 12:10:11 - granted resource MasterServer_Name.NBU_POLICY.MAXJOBS.Policy Name
01/25/2013 12:10:11 - granted resource TD1283
01/25/2013 12:10:11 - granted resource HP.ULTRIUM4-SCSI.000
01/25/2013 12:10:11 - granted resource STU1
01/25/2013 12:10:11 - estimated 0 kbytes needed
01/25/2013 12:10:11 - Info nbjm (pid=8105) started backup job for client CLient Server Name, policy policy Name, schedule Schedule NAme on storage unit STU1
01/25/2013 12:10:13 - started process bpbrm (pid=3645)
01/25/2013 12:10:23 - Info bpbrm (pid=3645) CLient Server Name is the host to backup data from
01/25/2013 12:10:23 - Info bpbrm (pid=3645) reading file list from client
01/25/2013 12:10:23 - connecting
01/25/2013 12:10:39 - Info bpbrm (pid=3645) starting bpbkar on client
01/25/2013 12:10:39 - Info bpbkar (pid=0) Backup started
01/25/2013 12:10:39 - Info bpbrm (pid=3645) bptm pid: 3689
01/25/2013 12:10:39 - connected; connect time: 0:00:00
01/25/2013 12:10:40 - Info bptm (pid=3689) start
01/25/2013 12:10:40 - Info bptm (pid=3689) using 65536 data buffer size
01/25/2013 12:10:40 - Info bptm (pid=3689) using 30 data buffers
01/25/2013 12:10:40 - Info bptm (pid=3689) start backup
01/25/2013 12:10:40 - Info bptm (pid=3689) backup child process is pid 3691
01/25/2013 12:10:40 - Info bptm (pid=3689) Waiting for mount of media id TD1283 (copy 1) on server MasterServer_Name.
01/25/2013 12:10:40 - mounting TD1283
01/25/2013 12:11:43 - Info bptm (pid=3689) media id TD1283 mounted on drive index 0, drivepath /dev/rtape/tape8_BESTnb, drivename HP.ULTRIUM4-SCSI.000, copy 1
01/25/2013 12:11:43 - mounted TD1283; mount time: 0:01:03
01/25/2013 12:11:43 - positioning TD1283 to file 5
01/25/2013 12:13:19 - positioned TD1283; position time: 0:01:36
01/25/2013 12:13:19 - begin writing
01/25/2013 14:19:07 - Error bpbrm (pid=3645) from client CLient Server Namet: ERR - Cannot write to STDOUT. Errno = 104: Connection reset by peer
01/25/2013 14:19:09 - Error bptm (pid=3689) media manager terminated by parent process
01/25/2013 14:19:23 - Info bpbkar (pid=0) done. status: 24: socket write failed
01/25/2013 14:19:23 - end writing; write time: 2:06:04
01/25/2013 14:29:23 - Info nbjm (pid=8105) starting backup job (jobid=4846) for client CLient Server Name, policy policy Name, schedule Schedule NAme
socket write failed (24)
-------------------------------------------------------------------------------------------------------------------

RamNagalla's picture

01/25/2013 14:19:23 - Info bpbkar (pid=0) done. status: 24: socket write failed

bpbkar is not started, 

do you still have this problem, or did fixed as saying in above post?

mansoor.sheik's picture

Hi Nagalla,

Backup is completed.Please find the Job details screnshot.

But it took 3 attempt to complete.

AttachmentSize
Socket_Issue_24.doc 126.5 KB
RamNagalla's picture

its just took around 36 GB, does it matching your data size in your File system?

Do you have decicated backup network or only single network for both backup and Production?

if you have decicated network could you make sure that the backup is using Dedicated network for data transport?

please provide the bpcd , bpkbar, bpbrm and bptm logs 

mansoor.sheik's picture

Hi,

My backsize is 36 Gb. We dont have dedicated network for backup environment.
I have uploaded the logs. Can u guide me how to get bpkbar logs.

AttachmentSize
Netbackup_Logs.zip 1.42 MB
Mark_Solutions's picture

It is bpbkar, not bpkbar - just need a directory under logs to create its log files.

Authorised Symantec Consultant

Don't forget to "Mark as Solution" if someones advice has solved your issue - and please bring back the Thumbs Up!!.

mansoor.sheik's picture

Hi Mark_Solutions,

Thanks....

Hi Nagella,

I have taken a backup to decommission the server.
After backup completion i intimated to server team, and they have removed the server(ESX).

RamNagalla's picture

opps, that was a typo... its bpbkar from Client.

as they removed the server , i guess you are done with your task.. ;-)

mansoor.sheik's picture

Hi Nagalla/All,

My backup is completed.
Thanks for your support.