MetadataSelectorExtractPlainText

A Boolean value that specifies whether to extract metadata as plain text.

To include only plain text in metadata fields created with MetadataSelector and MetadataFieldName, set this parameter to TRUE.

Type: Boolean
Default: False
Required: No
Configuration Section:

Any section that you have defined for WkoopHtmlExtraction settings

Example:

The following might appear in an HTML document:

<p class="important">This is <strong>important</strong> text</p>

To extract the information and add it to a metadata field you might use the following configuration:

MetadataSelectorExtractPlainText=TRUE
MetadataSelector0=p.important
MetadataFieldName0=important_paragraph

When MetadataSelectorExtractPlainText=True, the information is extracted as plain text. In this example the <strong> HTML tags are removed:

#DREFIELD important_paragraph="This is important text"
See Also:

MetadataFieldName

MetadataSelector