Video Screencast Help
Search Video Help Close Back
to help
Not able to make it to Vision this year? Get a sampling in the Best of Vision on Demand group.

DLP File Name Filter

Created: 29 Nov 2011 | 4 comments
Lind311's picture
0 0 Votes
Login to vote

Hi

I'm trying to create a IDM profile for one of our shares but have run into some issues.

1. It's saying that the share is too big to fully index and stops after a while. The share is not "big" in my opinion. It's about 8GB and about 20000 files in total. Looting through documentation it stated that you should be able to index between 300k-400k files. So what is the actual limit? Is it size, amount or something else?

2. So if above is true and there is nothing I can do about it then so be it... Instead of stomping around I thought that I might try to exclude some folders to reduce the share size. This, according to the manual, is done by File Name Filtering. So what I tried was to filter out some subfolders containing archives of documents.
The problem is that its sub-sub folders. Ex. I've pointed out the share and below this are about 130 folders that each and everyone contains a subfolder named archive. I can't seem to get the filter to understand that these should be excluded. The filter I tried was from the example in the manual: */Archive/*, *.tmp, *.log, *.bak
Tried changing from / to \ and also *\*\Archive\* but nothing works. What am I doing wrong?

Best regard

Oscar

Comments

Alexander Suyazov's picture
05
Dec
2011
0 Votes 0
Login to vote

I can answer about idm size.

I can answer about idm size. Limit is .rdx index file size. Max size is about 120 (128?) mb.
Can you show the error ?
Try to copy this share to local drive on enforce server and try again. (I had this kind of situation. If local indexing is complete successfully - it's the same problem)

fcruchaga's picture
07
Dec
2011
0 Votes 0
Login to vote

I may be able to help

First, are the files you are trying to IDM very Large and when I say that are they over 30MB in size (which by default is the largest file size the system will IDM). Also, I assume you are doing the IDM on files that are not locaed on the Enforce server correct? I would suggest testing  with a small Folder first and then keep adding to it and see at what point it fails. If you look at the TomCat logs it might show whats going on.

Lind311's picture
12
Dec
2011
0 Votes 0
Login to vote

Alexander: Here is the

Alexander:
Here is the message i recieve in the logfile:

 

12 dec 2011 09:36:18,558- Thread: 24 WARNING [com.vontu.profiles.manager.InfoSourceIndexCreator] Document profile. CAM test has reached maximum size. Only 6901 out of 12879 documents are indexed.
 
I have tried to narrow down the index with filtering but i have reached the limit for that now. Cant exclude any more without not indexing very essential files...
 
fcruchaga:
The files vary in size and i know the limit. The problem is that the files is ~15MB each so with the amount of ~12000 files you see the problem that some may go over 30mb i size. i can of couse exclude any file exceeding this but then i loose a lot of essential indexing.
The IDM is createx over SMB since its on a netapp.
I have tried to index a smaller folder and that works but thats not the problem.
 
 
 
Some documents i have from Symantec clearly states that if its a large share being indexed the server will create several .rdx files. By large i do not hope they mean 1,3GB data...
Still it doesn't create several .rdx files either.  As the storage grows i all the world this scenario seems more and more impossible as the product is designed today. Is there actually anybody that has multi user shares that is below 1,3GB data today? And even if we would create several smaller indexes for each share it still doesn't help because the share is constantly changing. The administration for this many indexes would be too time consuming.
 
Am i the only one that have this issue? Even the support doesn't see my problem as an actual problem... All of a sudden i feel very lonely... ;)
 
Lind311's picture
13
Dec
2011
0 Votes 0
Login to vote

Side note:One feature i'm

Side note:

One feature i'm missing that would be great for this scenario is date filtering.

I have noticed that many files are quite old and have newer versions of themselves. Being able to sort them out would help me a lot in this case... But that is just a wish of course... ;)