SYMPTOMS
The File System Collector collects the same documents twice. The reason that files are re-collected is because the AccessTime is configured to be captured by the FileSystemCollector by default. Documents are considered as being changed and therefore re-collected.
CAUSE
Although the cause may differ per configuration/implementation it is likely an external system event that retrieves/adds/modifies files on the filesystem structure. Based on analysis of the file system collection behavior it could be that the filesystem file property accessed date is modified from time to time.
RESOLUTION
The suggestion to solve this is to disable the accessed timestamp in order to prevent re-collection on modified metadata.
Retrieve Accessed Timestamp (Default = True).
This option will add the zcxad property to the metadata of the collected document to the database (Document table in the eDiscovery database).
Set the value to False:
<setting key="Retrieve Accessed Timestamp" value="False" />
.\ZyLAB\EDiscovery\Ediscoveryservice\Assemblies\Performers
ZyLAB.EDiscovery.Collectors.FileSystem.dll.xml
<?xml version="1.0" encoding="utf-8" ?>
<zylab>
<configurations>
<configuration id="Default">
<settings>
<setting key="Retrieve Original Folder" value="True" />
<setting key="Retrieve Created Timestamp" value="True" />
<setting key="Retrieve Modified Timestamp" value="True" />
<setting key="Retrieve Accessed Timestamp" value="False" />
<setting key="Retrieve Size" value="True" />
<setting key="Retrieve Attributes" value="True" />
<setting key="NSRL Matching Enabled" value ="False" />
<setting key="Detect Split Archives Enabled" value ="True" />
<!--<setting key="AttemptsToRetryIoOperation" value ="3" />
<setting key="WaitingTimeBetweenAttemptsToRetryIoOperation" value ="20" />-->
</settings>
</configuration>
</configurations>
</zylab>
The collected documents will be processed according to the specified processing rules.
APPLIES TO
3.5; 3.6
Comments
0 comments
Article is closed for comments.