TableCellPath

The XML path to a cell in the table.

This parameter allows you to perform entity extraction on structured table data, such as the OCR output from Media Server. TableCellPath provides information about the structure, which Eduction uses to match against HeaderEntityN and CellEntityN.

Eduction assumes that the cells are directly beneath rows, and that rows are directly beneath tables in the structure. You can optionally also use ColumnSpanAttribute to specify a cell attribute that defines the column span widths in your data.

The Media Server installation includes an example configuration and XLST file (NiFiMediaAnalysisOCR.cfg and NiFiMediaAnalysisOCR.xsl), which you can use with IDOL NiFi Ingest to create the structured data to use for Eduction.

For more information about table extraction, refer to the Eduction User and Programming Guide.

Type: String
Default: None
Required:

No

Configuration Section: Any section that you have defined for Eduction settings
Example:
TableCellPath=OCR/TABLE/ROW/CELL
See Also:

CellEntityN

EntityFieldN

HeaderEntityN

ColumnSpanAttribute