MetadataSelector

A CSS selector that identifies an element in the HTML to extract metadata from. The content of each matching element is extracted and added to the document metadata. If you combine this parameter with MetadataAttribute then the value of the specified attribute is extracted and added to the document metadata instead.

If you are extracting metadata for a child document, specify the selector relative to the element identified by ChildDocumentSelector.

To specify the name of the document field(s) to contain the extracted information, set the configuration parameter MetadataFieldName.

Type: String
Default:  
Required: No
Configuration Section: Any section specified by MetadataFieldSections, ChildDocumentMetadataFieldSections, or ChildMetadataFieldSections
Example:
[MyTask]
MetadataFieldSections=ExtractH1

[ExtractH1]
MetadataSelector=h1
MetadataFieldName=heading

With this example, the connector might extract the following from the HTML document:

<h1>This is a title</h1>

...and add the information to the following IDOL document fields:

#DREFIELD heading="This is a title"
See Also:

MetadataFieldName

MetadataAttribute