BPFIS process stuck (VM backup via VC)
Hi All,
The issue I am having has been going on for quite a while, we run a policy with a max of 4 jobs per policy, what happens is that when Netbackup kicks off a job it stops after it creates the bpbrm process (please see below), and in the backup host you will see a bpfis process sitting there doing nothing, now after a few days of backup when we have 4 stuck jobs and 4 bpfis process not exiting, NBU will not be able to do any backup due to the limit set on the policy, we get a status 156. Also while the process is stuck there are no activites in the VC console, so basically it doesn't even get to the part where it tries to create a snapshot. (This is not your normal 156 snapshot error, what we found is as soon as we end the stuck bpfis process we will get a successful backup)
Now the question is does anyone know or has experienced this problem where bpfis is not exiting and causing entire policy or other jobs to fail? If so do you have a solution? There is already a case opened with Symantec, but after looking at all the logs we are not really getting anywhere.
3/18/2011 4:00:00 PM - requesting resource XXXXXXXXXXXXXXX
3/18/2011 4:00:00 PM - requesting resource XXXXXXXXXXXXXXX.NBU_CLIENT.MAXJOBS.XXXXXXXXXXXX
3/18/2011 4:00:00 PM - requesting resource XXXXXXXXXXXXXXX.NBU_POLICY.MAXJOBS.XXXXXXXXXXXX
3/18/2011 4:00:00 PM - granted resource XXXXXXXXXXXXXXX.NBU_CLIENT.MAXJOBS.XXXXXXXXXXXX
3/18/2011 4:00:00 PM - granted resource XXXXXXXXXXXXXXX.NBU_POLICY.MAXJOBS.XXXXXXXXXXXX
3/18/2011 4:00:00 PM - granted resource MediaID=@aaaa5;DiskVolume=PureDiskVolume;DiskPool=XXXXXXXXXXXX;Path=PureDiskVolume;StorageServer=XXXXXXXXXXXX;MediaServer=XXXXXXXXXXXX
3/18/2011 4:00:00 PM - granted resource XXXXXXXXXXXX
3/18/2011 4:00:00 PM - estimated 351719880 Kbytes needed
3/18/2011 4:00:00 PM - begin Parent Job
3/18/2011 4:00:00 PM - begin Flash Backup Windows, Start Notify Script
3/18/2011 4:00:00 PM - started process RUNCMD (1264)
3/18/2011 4:00:00 PM - ended process 0 (1264)
Status 0
3/18/2011 4:00:00 PM - end Flash Backup Windows, Start Notify Script; elapsed time: 00:00:00
3/18/2011 4:00:00 PM - begin Flash Backup Windows, Step By Condition
Status 0
3/18/2011 4:00:00 PM - end Flash Backup Windows, Step By Condition; elapsed time: 00:00:00
3/18/2011 4:00:00 PM - begin Flash Backup Windows, Read File List
Status 0
3/18/2011 4:00:00 PM - end Flash Backup Windows, Read File List; elapsed time: 00:00:00
3/18/2011 4:00:00 PM - begin Flash Backup Windows, Create Snapshot
3/18/2011 4:00:00 PM - started
3/18/2011 4:00:01 PM - begin Create Snapshot
3/18/2011 4:00:01 PM - snapshot backup of client XXXXXXXXXXXX using method VMware
3/18/2011 4:00:01 PM - started process bpbrm (2528)
Any help would be great.
Thanks.
Comments
Environment?
NetBackup versions & operatings systems?
What's in the logs? Could you attach the relevant ones that you're looking at?
Regards Andy
"It's not too late to panic ..."
I ran into something similar
I ran into something similar last week with BPFIS. Our Exchange 2010 backups were failing and BPFIS was hung open. Restarting the clients fixed it in my case. I did a bit of digging and found that the issue was with the VSS writer not responding.
NBU 7.01 on Windows 2003 with LTO2 Library
Sorry for the long gap during
Sorry for the long gap during reply, caught up with other issues.
We are running NBU 7.1, Puredisk 6.6.1, VM media hosts are Windows server 2008 R2.
The bpfis logs are massive, I'll see if I can capture some tomorrow and snip it for you, we get few stuck everyday.
Thanks for the replies.
I had a look at the BPFIS
I had a look at the BPFIS log, but everything was ok in there, the issue is that the child job finishes and the parent job is stuck in the activity monitor, it doesn't actually generate any errors.
Any ideas?
Have you tried?
Hi there...
Can you try removing your VCenter server out of Netbackup and reenter it?
Make sure that the user credentials you provide has enough rights...
Fred
Would you like to reply?
Login or Register to post your comment.