StemmingFile
The name of a file to use to control stemming of the words that it contains. The specified StemmingFile
overrides the standard IDOL stemming algorithms for the language type MyLanguage
. For words that are not in the stemming file, IDOL Server stems values according to the value of the Stemming parameter for the language type.
IDOL Server applies the stemming file rules before it performs transliteration (if it is enabled).
The stemming file must be in the LanguageDirectory. You must format it like a stop list file, except that each line contains two terms; the original term followed by its stem. For example:
[ASCII] MICE MOUSE MOUSE MOUSE CHILDREN CHILD ...
The stemming terms can contain only alphanumeric values.
To ensure that two words stem to the same value, you must add both words to the stemming file, with the appropriate stem. For example:
MICE MOUSE MOUSE MOUSE
In this example, MICE
and MOUSE
both stem to MOUSE
. If you specify only MICE
MOUSE
in the stemming file, MOUSE
stems to MOUS
, according to IDOL stemming rules.
NOTE: Use a stemming file only for unusual terms in special situations. Micro Focus does not recommend that you use a stemming file as a replacement for the IDOL stemming algorithms.
If you change the stemming file after you have indexed content into IDOL Server, you must reindex the content.
If you specify a full path to the Stoplist, you must also use a full path in the DecompositionFile and StemmingFile
parameters. If you do not specify a path to the Stoplist, you must use only the file name in the DecompositionFile and StemmingFile
parameters (that is, you must place these files in the LanguageDirectory).
If you use a custom stemming library, stems in the stemming file take precedence. See CustomStemming.
Type: | String |
Default: | |
Required: | No |
Configuration Section: | LanguageTypes or MyLanguage |
Example: | [English]
|
See Also: | Stemming
Transliteration |
NOTE: If you change this setting after you have indexed content into IDOL Server, the new setting applies only to new content, and the server logs a warning. To clear the warning and ensure that your change applies to all your content, you must initialize your index and reindex the content.