CellEntityN

The entities to extract for the cell rows of input tables. This parameter allows you to extract entities from structured data.

When matching CSV or TSV input, Eduction matches the first one or more non-empty row of input against the configured header entities (see HeaderEntityN and MaxSearchHeaderRow). In subsequent rows, Eduction matches individual cells against the cell entity corresponding to the matched header entity, if any.

For example:

HeaderEntity0=pii/date/dob/landmark/all
CellEntity0=pii/date/nocontext/all

This example matches date of birth landmark values in the header, and for all subsequent rows in that column, it extracts any date values.

You can specify multiple entities in a comma-separated list. If the table header matches any of the configured header entities, Eduction matches the cell content against any of the configured cell entities. This option might be useful if you want to match a particular entity in multiple languages, or if you want to include a custom entity.

You can also use wildcard expressions in the entity names. The * wildcard matches any number of characters, and the ? wildcard matches a single character.

For more information about table extraction, refer to the Eduction User and Programming Guide.

Type: String
Default: None
Required:

No

Configuration Section: Eduction
Example:
HeaderEntity0=pii/date/dob/landmark/all
CellEntity0=pii/date/nocontext/all
See Also:

HeaderEntityN

CellEntityMatchLimitN