MSDP status 2074, but volume is not down
Does anyone here have any experience yet using Cisco UCS blades as NBU media servers?
We are migrating our existing Oracle x86 and Solaris Sparc media server hardware connecting to Data Domains over NFS, to Cisco UCS blades running Red Hat Linux and fiber channel attached storage using NBU's Media Server Deduplication Pools. We are running NBU 220.127.116.11 on all servers, and most clients are running 18.104.22.168.
We have been experiencing sporadic status 2074's during heavy backup loads on one MSDP. But the MSDP is not actually down, and the jobs auto-retry 1-2 hours later and are successful. I am not upping the MSDP, and I can't find any indication that anything else is cycling the services or rebooting the server.
I do not find a storaged.log or spoold.log on the server, and I just uncommented the /usr/openv/lib/ost-plugins/pd.conf line to enable PureDisk logging and cycled NBU on that server a little while ago, so I don't have any logs from that yet.
If you have worked with this combination of hardware/OS/NBU, do you have any advice on UCS tuning or settings? We're also having other issues with TCP tuning (socket reads/writes failing).