Redact Documents
You can use the Eduction
task to redact information in documents.
To enable redaction, set the configuration parameter RedactedOutput=True
. If you want to specify the value or characters that replace the redacted text, use the configuration parameter RedactionOutputString
or RedactionReplacementCharacter
.
For example, the following configuration redacts addresses contained in a document's DRECONTENT
or ADDRESS
fields:
[ImportTasks] Post0=Eduction:EductionSettings [EductionSettings] ResourceFiles=C:\Autonomy\IDOLServer\Eduction\address_gb.ecr SearchFields=DRECONTENT,ADDRESS RedactedOutput=True
The fields specified by SearchFields
are not modified. CFS places the redacted text in fields with a _REDACTED
suffix. For example:
#DREFIELD ADDRESS="Cambridge Business Park, Cowley Road, Cambridge, CB4 0WZ" #DREFIELD ADDRESS_REDACTED="[redacted]"
The Eduction
task also adds the value, offset, and score for any matched entities to the document. For example:
#DREFIELD /offset="298" #DREFIELD /score="1" #DREFIELD /value="Cambridge Business Park, Cowley Road, Cambridge, CB4 0WZ"