Redact Documents

You can use the Eduction task to redact information in documents.

To enable redaction, set the configuration parameter RedactedOutput=True. If you want to specify the value or characters that replace the redacted text, use the configuration parameter RedactionOutputString or RedactionReplacementCharacter.

For example, the following configuration redacts addresses contained in a document's DRECONTENT or ADDRESS fields:

[ImportTasks]
Post0=Eduction:EductionSettings

[EductionSettings]
ResourceFiles=C:\Autonomy\IDOLServer\Eduction\address_gb.ecr
SearchFields=DRECONTENT,ADDRESS
RedactedOutput=True

The fields specified by SearchFields are not modified. CFS places the redacted text in fields with a _REDACTED suffix. For example:

#DREFIELD ADDRESS="Cambridge Business Park, Cowley Road, Cambridge, CB4 0WZ"
#DREFIELD ADDRESS_REDACTED="[redacted]"

The Eduction task also adds the value, offset, and score for any matched entities to the document. For example:

#DREFIELD /offset="298"
#DREFIELD /score="1"
#DREFIELD /value="Cambridge Business Park, Cowley Road, Cambridge, CB4 0WZ"