ConvertDocumentFile

Knowledge Discovery documents can have one or more associated files. In NiFi Ingest, FlowFiles that represent documents can contain:

  • a path to a file stored on a local or network file system (a contentfilename part)
  • embedded binary content (a contentfile part)
  • a reference to a file stored by an external storage provider such as Amazon S3, Azure Blob Storage, or Google Cloud Storage (an externalfile part)

This processor provides a way to convert between these types of FlowFile part. For example, the processor can take embedded binary content, upload the file to external storage, and replace the contentfile part with an externalfile part.

For more information about document FlowFiles, see Introduction to FlowFiles and Documents.

Properties

Name Default Value Description
Convert Source Type all-source-types

The type of associated file to convert from.

For example: When you select filename, FlowFile parts containing local file paths are converted, but the processor does not modify FlowFile parts containing embedded binary content.

Convert Target Type embedded-file

The type of associated file to convert to:

  • filename
  • embedded-file
  • external-file

    NOTE: Reading files from external storage can be much slower than reading local files. Using external storage could degrade performance, especially if your dataflow has a series of processors that all require access to the files.

Idol Flow File Service   When you convert associated files to external files you must specify an IdolFlowFileServiceImpl that manages the connection to the external file storage provider.
Delete file if have ownership True Whether to delete the original files, when converting from file names or external files to another type of associated file, and NiFi Ingest has ownership of the original file.

Relationships

Name Description
success FlowFiles that were successfully processed.
failure FlowFiles that were not successfully processed.