Information and data evidence to collect after a Domino server crash
|Article:TECH83986|||||Created: 2005-01-12|||||Updated: 2011-12-21|||||Article URL http://www.symantec.com/docs/TECH83986|
A Domino server where Symantec Mail Security for Domino (SMSDOM) has crashed and the Notes System Diagnostics (NSD) show the faulting process name: nntask.
NNTASK is SMSDOM's main process name.
Before proceeding, it is very important to locate a file called smspanic.txt in the system.
The file may contain useful information to research in the Symantec Technical Knowledge Base.
Gathering information and data evidence for Symantec Technical Support
When Technical Support is contacted for a crash issue, please have the following information and data evidence ready:
Required: Description of crash
- Which process is FATAL? What exception code did the fatal process report?
- What event, if any, occurs at the same time as the crash?
- Does the crash occur more than once? Regularly? At a specific time?
- If at seemingly random intervals, what is the longest and shortest time between crashes?
- Does load seem to be related?
- Does LiveUpdate seem to be related?
- If LiveUpdate is related to the crash, does the crash occur if LiveUpdate is run manually, or when scheduled?
- Does LiveUpdate retrieve definitions from the public Symantec LiveUpdate servers or a private server internal to the network?
- Does SMSDOM get definitions in any other way such as replication or via SAVCE or SEP?
- How many Domino servers have experienced the crash?
- How many Domino servers are in the environment?
- Are any non-Domino servers impacted in any way? If so, how?
- What was the first symptom that led you to suspect the Domino server crashed?
- Were there any other symptoms related to the crash?
- Can the crash be reproduced on demand? If so, what are the steps to reproduce? Is there a sample file?
Required: Domino Configuration
- What is the exact version of SMSDOM or SAVfDOM?
- What is the exact version, type, and bitness of Domino?
(Examples: Domino 8.5FP1 Messaging Server [64-bit], Domino 7.0.4 [32-bit])
- What is the exact version of operating system, including type, bitness, and Service Pack, if any?
(Examples: Windows2003Enterprise SP2[64-bit], RHEL4AS Update 8 [32-bit])
- Partitions: Is there more than one DPAR or LPAR?
- Domino Clusters: Describe each type of Domino cluster where the Domino server is either a hub server or spoke server.
- Are any of the Symantec databases replicated?
- Other Clusters: Describe every other type of clustering where the Domino server is either an active or passive node.
- Realtime Filesystem Utilities: Enumerate each realtime filesystem utility, including name, manufacturer, and exact software version.
These might be local filesystem antivirus, automatic background disk defragmenters, or realtime mirroring utilities.
(Examples: Symantec Antivirus Corporate Edition 10.2.1, Diskeeper)
- Exclusions: Is SMSDOM excluded from the realtime filesystem utilities?
- Backups: which product backs up Domino server data? Do backups occur within Domino, at a filesystem level, or both?
- When are backups scheduled?
- What third party software runs as an extension within the Domino extension manager or as a task within Domino?
- What recent changes were implemented in the environment before the crash or first crash?
Required: Data evidence for Symantec Technical Support
The following data should be collected from each affected Domino server.
- The Domino main configuration file: notes.ini
- "smspanic.txt" file containing text. The default locations for this file are as follows:
- Windows: C:\Program Files\Symantec\SMSDOM\smspanic.txt
- *nix/Solaris: <Domino Data Directory/SAV/savlog/smspanic.txt
- If the smspanic.txt file is not present, or contains no text, please attempt to reproduce the crash, or wait for the crash to reproduce, with the following forms of tracing enabled:
- SMSDOM trace logging (Please see the article TECH80022 in the "Related Articles" section below)
- Semaphore debugging within Domino (Please see this IBM article for more info)
- Notes.ini set to produce nsd's containing memcheck data
- Notes.ini should contain the following lines:
- The following NSF files:
- Savdefs.nsf (if it exists)
NOTE: all NSF files must not have any local access protection set.
- Compressed archive of the entire folder: <Domino installation path>
- Output of running processes/tasks on the Domino server
- Windows: screen shot(s) of Processes tab within Task Manager with "Show processes from all users" checked, sorted by Image Name
- *nix/Solaris: ps -ef > processes.txt
- LiveUpdate configuration
- Windows: \Documents and Settings\All Users\Application Data\Symantec\LiveUpdate\Settings.LiveUpdate
- *nix/Solaris: /etc/liveupdate.conf
- LiveUpdate logs
- Windows: \Documents and Settings\All Users\Application Data\Symantec\LiveUpdate\Log.LiveUpdate
- *nix/Solaris: /tmp/livupdt.log
- Operating System Logs
- Windows: Application Log and System Log, and Dr.Watson Log, if one exists
- *nix/Solaris /var/log/messages
- Operating system tuning and configuration data
- Windows: a compressed archive containing exports of the following registry keys, should they exist:
- HKLM\System\CurrentControlSet\Control\Session Manager
- Any other data which relates to the crash, such as a Mail.box containing messages which caused the crash
OS-Specific logs required
iSeries - OS/400
Please get the job log of the dead NNTASKprocess, using the following commands:
DSPSPLF FILE(QPJOBLOG) JOB(job_number/Usrid/Job_name)
DSPJOB JOB(job_number/Usrid/Job_name) OPTION(*SPLF)
for more information please refer to: http://publib.boulder.ibm.com/infocenter/iadthelp/v6r0/index.jsp?topic=/com.ibm.etools.iseries.rse.doc/tasks/tdspjoblog.htm
When you contact Symantec Technical Support, provide as much specific information as possible.
Your support agent can work with you to transfer your log information to them.
The following IBM documents can help understand NSD files:
- Title: NSD and Memcheck for Windows - Selected Versions - Document 4003878
- Title: Overview of NSD Analysis on UNIX and S/390 - Document 7003396
- Title: Memcheck: What Is It and How Does it Work? - Document 1086330
- Title: Using NSD - A Practical Guide
Utilities to crash client and server
Article URL http://www.symantec.com/docs/TECH83986