Client Management Suite

 View Only

Server Fault Tolerance for Software Delivery Solution 

Dec 19, 2007 03:52 PM

While looking for information regarding package server redundancy, I was unsuccessful in locating any references. Hopefully, this posting will fill the missing information gap.

Question:

If a package server was defined for a site and that package server goes down, how will the Altiris Agent clients respond? Original thought was that if the clients were unable to pull from the package server, the clients would fail over to the NS to get their distribution.

Testing Results:

The client tries to download the package from the package server assigned to the site (which was shut down for testing). Since the server was down, initial download was unsuccessful but then client tried to connect again in 3 minutes, then 6 minutes. The check-in time kept doubling until the client checked every 2 hours. From that point on, the client continued to check every 2 hours until the package server came back up.

Solution:

Fault tolerance was achieved by defining 2 package servers for a site. In this scenario the client attempted to contact the initial package server but since the server was down the client then switched to the second package server that was defined.

This discovery led to another question: Would the client point back at the original package server once it was brought up?

The clients that failed over to the new package server continued to pull packages and updates from the new package server. Not until the new package server was shut down -- and the original package server brought up -- did the clients fail over to the original package server.

The package.xml file for each package snapshot contains a static list of package servers in the order that the package servers were added. The package servers are listed in the order created within the package.xml file. Therefore, the client will initially try to pull from the last known good package server and, if that server is not responding, the client will continue down the package.xml codebase reference to the next package server.

Statistics
0 Favorited
0 Views
0 Files
0 Shares
0 Downloads

Tags and Keywords

Comments

Jan 24, 2008 11:18 AM

I am asked about fault tolerance for package servers quite often. Thanks for sharing this with the community.

Dec 20, 2007 11:08 AM

We have about 105 package servers scattered all over the world just for our primary NS in the US.
So keeping them all up and running is a pretty tough task (since the majority are running on standard non-RAID, SFF desktop PCs).
There are some changes you can make to CoreSettings.config on the NS to adjust this time; I think it is called "PackageServerMaxDownloadTryingTime" or similar.
See https://kb.altiris.com/article.asp?article=1719&p=1

Dec 19, 2007 08:53 PM

This is great piece of information in setting package servers. Thank you.
When a package.xml is received on client side, client downloads package from nearest package server out of the list.
Thanks!
dgoyani

Related Entries and Links

No Related Resource entered.