Grammars

A grammar is a file that provides rules for complicated entities such as URLs or postal addresses. The entities can be recursively defined. Rules can refer to entities in external grammars and dictionaries. Eduction uses the grammar to scan a document and extract the defined entities that match the search pattern.

The pattern can be a dictionary of names such as people or places, or the pattern can describe what the sequence of text looks like without having to list it explicitly, for example, a telephone number, or a time.

Grammars are written in XML and the regular expression format. Eduction supports context-free grammars.

Eduction also allows you to extend existing grammars, and to author new ones, either from scratch or by referencing existing entities. For more information about how and when to extend your grammars, and how to improve the recall of your grammar files, refer to IDOL Expert.


_HP_HTML5_bannerTitle.htm