XML Output Format

This section describes the XML output tags and attributes that edktool returns when you run the extract command with a plain text file.

<MATCH>

The details of a match.

Attribute Description
EntityName The name of the matched entity.
Offset The match start position in the buffer, in bytes.
OffsetLength

The match start position in the buffer, in characters.

Score The final score for this match.
NormalizedTextSize

The length of the normalized text, in bytes.

NormalizedTextLength

The length of the normalized text, in characters.

OriginalTextSize

The length of the matched text, in bytes.

OriginalTextLength

The length of the matched text, in characters.

The following table lists the child elements that <MATCH> contains.

Child Elements Description
<ORIGINAL_TEXT>

The text from the input that was matched.

<NORMALIZED_TEXT>

The normalized match text.

Normalized text might differ from the original text if, for example, a synonym was matched (in which case the original text is replaced with the headword), or a pattern used the replace attribute or the (?A!...) syntax.

The text might also be adjusted by post-processing scripts, if configured.

<COMPONENTS> The match components, if present for this entity and when EnableComponents=TRUE.

<COMPONENT>

The details of a component.

Attribute Description
Name The component name.
Text The component text (normalized, if applicable).
Offset The component start position in the matched text, in bytes.
OffsetLength The component start position in the matched text, in characters.
TextSize The component text length, in bytes.
TextLength The component text length, in characters.