RemoveDocumentPart

Removes parts of a document.

The primary reason to use this processor is to remove contentfilename or externalfile document parts, which allows NiFi Ingest to delete temporary files. OpenText recommends you do this at the end of your dataflow, after extraction, filtering, media analysis, and so on.

Properties

Name Default Value Description
Remove document content parts false Specifies whether to remove document content.
Remove document contentfile parts true Specifies whether to remove binary file content.
Remove document contentfilename parts true

Specifies whether to remove references to files (local or network file system paths).

A contentfilename part of a document contains a path to a file on disk. If the file is a temporary file that is owned by NiFi, then deleting this part of the document allows NiFi to delete the temporary file.

Remove document externalfile parts true

Specifies whether to remove references to files (in external storage such as Amazon S3, Azure Blob Storage, or Google Cloud Storage).

An externalfile part of a document contains a reference to a file in external storage. If the file is a temporary file that is owned by NiFi, then deleting this part of the document allows NiFi to delete the temporary file.

Remove document xmlmetadata parts false Specifies whether to remove document metadata.

Relationships

Name Description
success FlowFiles that were processed successfully.
failure FlowFiles that had an invalid or unknown format.