The parameters in this section customize the BinaryFileFilter
task.
This task rejects any documents that have been filtered as binary. This can occur when the file associated with a document is corrupt and KeyView filtering fails. When CFS detects a non-UTF8 character, it replaces the character with a hexadecimal character code. The BinaryFileFilter
task detects these character codes and rejects documents where the proportion exceeds the limit set by the ThresholdPercent parameter.
You must configure BinaryFileFilter
as a Post task.
The task reads configuration parameters from the section of the configuration file that you specify in the Post
parameter, for example:
[ImportTasks] Post0=BinaryFileFilter:BinaryFileFilterSettings [BinaryFileFilterSettings] ThresholdPercent=10 OnErrorIndexerSections=IdolErrorServer IndexDatabase=IdolErrorReview
|