The following table describes how the ProperNames
setting affects the terms that the IDOL Content component stores for the sentence Tom Jones And His greatest hits.
Original | Tom | Jones | And His | greatest | hits | |||
0
|
TOM
|
JONE
|
GREAT
|
HIT
|
||||
1
|
TOM
|
TOMJON
|
JONE
|
GREAT
|
HIT
|
|||
2
|
TOM
|
TOMJON
|
JONE
|
GREAT
|
GREATESTHIT
|
HIT
|
||
3
|
TOM
|
TOMJON
|
JONE
|
ANDHI
|
GREAT
|
HIT
|
||
4
|
TOM
|
TOMJON
|
JONE
|
JONESAND
|
ANDHI
|
GREAT
|
HIT
|
|
5
|
TOM
|
TOMJONES
|
JONE
|
ANDHIS
|
GREAT
|
HIT
|
||
6
|
TOM
|
TOMJONES
|
JONE
|
JONESAND
|
ANDHIS
|
GREAT
|
HIT
|
|
7
|
TOM
|
JONE
|
JONESAND
|
ANDHIS
|
GREAT
|
HIT
|
If the IDOL Content component contains these documents, the following queries produce different results according to your ProperNames
settings.
Doc 1: | Doc 2: | |
Tom Waits and The The in concert with Norah Jones | Tom Jones and the the in concert with Katie Melua |
action=Query&Text=Tom Jones
If you set ProperNames
to 0
or 7
, both documents return with the same relevance (in both cases, the query to Content has the terms TOM and JONE, which match both documents).
If you set ProperNames
to 1
, 2
, 3
, 4
, 5
, or 6
, Doc 2 returns with a higher relevance than Doc 1 (because it matches not just the terms TOM and JONE, but also TOMJON or TOMJONES).
action=Query&Text=tom jones
If you set ProperNames
to 0
, 1
, 3
, 4
, 5
, 6
, or 7
, both documents return with the same relevance (in both cases, the query to Content has the terms TOM and JONE, which match both documents).
If you set ProperNames
to 2
, Doc 2 returns with a higher relevance than Doc 1 (because it matches not just the terms TOM and JONE, but also TOMJON).
action=Query&Text=The The
If you set ProperNames
to 0
, 1
, or 2
, the query returns no results (because Content discards both instances of the word The as stop words).
If you set ProperNames
to 3
, 4
, 5
, 6
, or 7
, only Doc 1 returns (because in all these cases the query to Content has the term THETH or THETHE, which matches only Doc 1).
action=Query&Text=the the
If you set ProperNames
to 0
, 1
, 2
, 3
, 4
, 5
, 6
, or 7
, no results return (because Content discards both instances of the word the as stop words).
|