TableCellPath
The XML path to a cell in the table.
This parameter allows you to perform entity extraction on structured table data, such as the OCR output from Media Server. TableCellPath
provides information about the structure, which Eduction uses to match against HeaderEntityN and CellEntityN.
Eduction assumes that the cells are directly beneath rows, and that rows are directly beneath tables in the structure. You can optionally also use ColumnSpanAttribute to specify a cell attribute that defines the column span widths in your data.
The Media Server installation includes an example configuration and XLST file (NiFiMediaAnalysisOCR.cfg
and NiFiMediaAnalysisOCR.xsl
), which you can use with IDOL NiFi Ingest to create the structured data to use for Eduction.
For more information about table extraction, refer to the Eduction User and Programming Guide.
Type: | String |
Default: | None |
Required: |
No |
Configuration Section: |
|
Example: | TableCellPath=OCR/TABLE/ROW/CELL |
See Also: |