HyphenChars
A list of the characters (in unencoded UTF-8) to use as hyphens.
Set HyphenChars
to NONE
if you do not want to use any characters as hyphens.
IDOL Content Component treats hyphenated words specially when it indexes them, and indexes both the subparts and the whole hyphenated word. For example, if HyphenChars
is set to -
(the default value), IDOL Content Component indexes second-hand
as second, hand, and secondhand, and you can query all three terms.
At query time, IDOL Content Component tokenizes the hyphenated term as the whole term and individual subterms, all separated by the configured DefaultQueryOperator. For example, if the DefaultQueryOperator is OR (the default value), a query for second-hand
becomes secondhand OR second OR hand
.
NOTE: If you change the DefaultQueryOperator, you might not get expected query results for hyphenated terms.
The separate parts of a hyphenated term (such as second and hand) are available only for basic keyword searches, and they do not match proximity expressions or field-restricted searches. For example, if a document contains the phrase second-hand car
, a keyword search for hand
matches the document, but an exact phrase search for "hand car"
does not.
In most situations, OpenText recommends that you set HyphenChars
to NONE
, and use the hyphen as a separator (for example, set AugmentSeparators to -
). For more information, refer to IDOL Expert.
Type: | String |
Default: | -
|
Required: | No |
Configuration Section: | LanguageTypes or MyLanguage |
Example: | HyphenChars=-/.
|
See Also: | AugmentSeparators
TangibleCharacters |
NOTE: If you change this setting after you have indexed content into IDOL Server, the new setting applies only to new content, and the server logs a warning. To clear the warning and ensure that your change applies to all your content, you must initialize your index and reindex the content.