FilenameFromMetadata

The FilenameFromMetdata processor extracts file names or URLs from document metadata fields. If the extracted value is a URL, the processor also downloads the file. The file name or downloaded file is associated with the FlowFile so that it can be processed by other processors.

Properties

Name Default Value Description
XPath   An XPath expression that identifies the XML metadata field(s) to extract file names or URLs from.
Filename from field regex ^(.*)$ A regular expression that includes a single capture group that extracts the file name or URL from the field value. The default extracts the entire field value.
Proxy Host   The host name or IP address of the proxy server to use to download URLs.
Proxy Port   The port of the proxy server to use to download URLs.
Filename Is URL False A Boolean value that specifies whether the text matched by the regular expression capture group is a URL to a file that must be downloaded.
Temp Directory temp The path to a temporary directory in which to store downloaded files.

Relationships

Name Description
success Successfully processed FlowFiles are routed to this relationship.
failure FlowFiles that had an invalid or unknown format.