Languages
The languages that you expect to appear in the text. Set this parameter to a comma-separated list of ISO 639-1 language codes. This instructs Media Server to expect characters from the appropriate character sets, and specifies the dictionaries to use.
TIP: The parameter FontType also restricts the languages that are used. For example, the OCR-A and OCR-B font types only support languages that use the Latin alphabet. The Languages
parameter is ignored if you set FontType=e13b
or FontType=fa7b
.
To enable all languages supported by your chosen FontType, you can set this parameter to ALL
. OpenText recommends this value only when you do not know which languages to expect. Specify one language or a small number of languages to obtain the best speed and accuracy.
To detect only ASCII characters, with no language-specific dictionary, set this parameter to ASCII
. This option is useful if you want to match text against a known list of words or codes, that use only ASCII characters and are defined in a custom dictionary.
You can add additional characters to the expected character set(s) with the parameter ExtraEnabledCharacters, remove characters from the character set(s) with the parameter DisabledCharacters, and configure additional dictionaries by setting the parameter UserDictionary.
For a list of supported languages and their language codes, see OCR Supported Languages.
Type: | String |
Default: | en |
Required: | No |
Configuration Section: | TaskName |
Example: | Languages=en,fr,ja
|
See Also: |