Enterprise Vault for Exchange & Domino EVDuplicateCleaner for 9.0.x and 10.0.x

Article:TECH193878  |  Created: 2012-07-27  |  Updated: 2014-05-05  |  Article URL http://www.symantec.com/docs/TECH193878
Article Type
Technical Solution

Product(s)

Issue



Under some circumstances EV mailbox archiving task failures could result in creating duplicate items in EV mailbox archives.


Solution



The EVDuplicateCleaner utility can be used to identify all the duplicate Savesets and then delete them leaving only last archived instance from each set of duplicate Savesets.

NOTE: EVDuplicateCleaner tool is now a standard component of Enterprise Vault 11.0; this tool can be found in the Enterprise Vault installation directory for Enterprise Vault 11.0 installations. In addition, the Symantec Enterprise Vault 11 Utilities guide contains more documentation about this tool. Refer to your Enterprise Vault 11 documentation for more details.

Installation Instructions
=========================
1. Stop the Enterprise Vault Services. Possibly also reset IIS (iisreset).

2. Copy the following files to the Enterprise Vault Install directory.
EVDuplicateCleaner.exe
EVDuplicateCleaner.exe.config

3. Edit the EVDuplicateCleaner.exe.config setting the configurations values as detailed in following section.

4. Restart Enterprise Vault Services.

5. The utility can then be run from the command prompt (Run As Administrator)

 

Execution/running Instructions
==============================
IMPORTANT: * THIS UTILITY MUST BE RUN UNDER VSA ACCOUNT
** BEFORE RUNNING THE TOOL, TURN OFF THE JOURNAL TASK

Following are the various settings (in EVDuplicateCleaner.exe.config) that should be correctly configured before running this utility.
1. DirDBSQLServer - Mandatory, the name of the SQL Server hosting Directory database.
2. MaxAllowedFailures - Maximum number of errors after which it will stop processing (default = unlimited) .
3. SavesetChunkSize - Number of duplicate savesets to fetch in one call to directory database (default = 10000).
4. ErrorToLogFile - Do we need to log errors to output log file (default = yes).
5. MaxDuplicateSavesetsToReport - Maximum number of duplicated savesets that will be processed in 'Report Mode' (default = 100000).
6. TryToOverrideDumspter- If dumpster is enabled then this decides if we want items to be hard deleted (i.e. not going to dumpster but deleted directly).
This override will work only from V75 onwards. (default = true)
7. SkipLegacySavesets - If there are not legacy savesets (pre EV 8.0), then set this to True to skip even bothering looking for them.

Following are the supported commands for running this utility.

EVDuplicateCleaner.exe Execute[E,Ex,Exe,...,Execut] <VaultStoreName> [<ArchiveEntryId>]
EVDuplicateCleaner.exe Report[R,Re,Rep,...,Repor] <VaultStoreName> [<ArchiveEntryId>]
EVDuplicateCleaner.exe Summary[S,Su,Sum,...,Summar] <VaultStoreName> [<ArchiveEntryId>]

Details of various commands
1. Execute - In this mode EVDuplicateCleaner.exe will delete all duplicate savesets. VaultStoreName is mandatory, ArchiveEntryId may also be provided in order to process one single Archive.
2. Report - The output log will contain list of duplicate savesets (by default maximum 100000) and a count of how many duplicate items each one has.
3. Summary - Will just output summary of duplicates per archive based on the supplied inputs.

 

1. This utility uses the following attributes of a saveset to identify duplicates

EV 8.0 savesets
--------------
ArchivePointIdentity,VaultIdentity (folder identity) ,IdDateTime (creation date),IdUniqueNo, SisPartCount.
In addition to the above, each item's content fingerprint is checked. Items with different content fingerprint will not be treated as duplicate.

2. Maximum number of duplicate savesets processed in 'report mode' will be limited to value configured through 'MaxDuplicateSavesetsToReport' setting (default = 100000).

3. This utility will not delete duplicate items that are on legal hold or on other compliance devices of if for similar reasons (retention category) delete is prohibited.

4. Running the utility and removing duplicates may have the effect of temporarily "breaking" shortcuts in Exchange user mailboxes as the Saveset id that is contained in the shortcut may be incorrect if it points to a now removed duplicate.
 This can be fixed by using the modified Shortcut Processing - see below.

 NOTE: This behavior is not expected to occur with Domino archives as the shortcut should reflect the latest archived copy of the item which is maintained by the tool, but if any examples do occur, contact Support for further investigation.

Enhancements to Shortcut Processing
 ===================================
 In certain conditions the duplicate removal tool may have the effect of breaking shortcuts.
 After running EVDuplicateCleaner users may complain that they can no longer retrieve items from their mailbox shortcuts.
 This does not affect archive explorer or the web search app, and shortcuts will be expired according to site policy anyway, but if users really want their shortcuts repaired then the following regkey can be used to activate the new "shortcut fix up" functionality:
 
HKLM\SOFTWARE\KVS\Enterprise Vault\Agents\FixOrphanedShortcut  - DWORD; set to 1 to enable new function.
 
*******************************************************************************************************************************************
 * BE SURE TO REMOVE THE REGKEY ONCE THE MAILBOXES HAVE BEEN FIXED UP - otherwise mailbox processing performance may be adversely affected *
 *******************************************************************************************************************************************
 
Also: if the mailboxes contain duplicates of meeting requests, be sure to add "IPM.Schedule*" to the message classes supported by the site and then check the same message class in the policy for the mailbox in question.
 
This regkey has the effect of FORCING more intensive checks for orphaned shortcuts.
 If a shortcut is deemed to be "orphaned" before it is finally removed a check is made to see if a duplicate Saveset could exist that this shortcut can be fixed to point to. If that is the case then the shortcut will be fixed up to point to the duplicate of the original Saveset. If there is any doubt about the duplicate Saveset then the shortcut will not be fixed. If not duplicate can be found then the shortcut will be deleted as orphaned.
 

Report Mode:
------------
To see the effect of the new shortcut processing BEFORE actually performing shortcut processing run the mailbox archiving task in "Report" mode against any mailboxes you wish to fix up.
A report will be generated per-mailbox in the Reports folder beneath the EV install folder.
It details either what would happen or what happened during shortcut processing.

Notes:

1. This tool is version specific - Make sure to install the correct version only

 


Attachments

EVDuplicateCleaner for EV 9.0.1 - For Exchange Archives only.
9_0_1EVDuplicateCleaner.zip (25 kBytes)
EVDuplicateCleaner for EV 9.0.2- For Exchange Archives only.
9_0_2EVDuplicateCleaner.zip (25 kBytes)


EVDuplicateCleaner for EV 9.0.3
9_0_3EVDuplicateCleaner.zip (25 kBytes)
EVDuplicateCleaner for EV 9.0.4
9_0_4EVDuplicateCleaner.zip (25 kBytes)
EVDuplicateCleaner for EV 10.0.1
10_0_1EVDuplicateCleaner.zip (25 kBytes)
EVDuplicateCleaner for EV 10.0.2
10_0_2EVDuplicateCleaner.zip (25 kBytes)
EVDuplicateCleaner for EV 10.0.3
10_0_3EVDuplicateCleaner.zip (26 kBytes)
EVDuplicateCleaner for EV 10.0.4
10_0_4EVDuplicateCleaner.zip (25 kBytes)


Article URL http://www.symantec.com/docs/TECH193878


Terms of use for this information are found in Legal Notices