Example Proper Name Searches

The following table describes how the ProperNames setting affects the terms that the IDOL Content component stores for the sentence Tom Jones And His greatest hits.

Original Tom   Jones   And His greatest   hits
0 TOM   JONE     GREAT   HIT
1 TOM TOMJON JONE     GREAT   HIT
2 TOM TOMJON JONE     GREAT GREATESTHIT HIT
3 TOM TOMJON JONE   ANDHI GREAT   HIT
4 TOM TOMJON JONE JONESAND ANDHI GREAT   HIT
5 TOM TOMJONES JONE   ANDHIS GREAT   HIT
6 TOM TOMJONES JONE JONESAND ANDHIS GREAT   HIT
7 TOM   JONE JONESAND ANDHIS GREAT   HIT

If the IDOL Content component contains these documents, the following queries produce different results according to your ProperNames settings.

Doc 1:   Doc 2:
Tom Waits and The The in concert with Norah Jones   Tom Jones and the the in concert with Katie Melua
  • action=Query&Text=Tom Jones

    If you set ProperNames to 0 or 7, both documents return with the same relevance (in both cases, the query to Content has the terms TOM and JONE, which match both documents).

    If you set ProperNames to 1, 2, 3, 4, 5, or 6, Doc 2 returns with a higher relevance than Doc 1 (because it matches not just the terms TOM and JONE, but also TOMJON or TOMJONES).

  • action=Query&Text=tom jones

    If you set ProperNames to 0, 1, 3, 4, 5, 6, or 7, both documents return with the same relevance (in both cases, the query to Content has the terms TOM and JONE, which match both documents).

    If you set ProperNames to 2, Doc 2 returns with a higher relevance than Doc 1 (because it matches not just the terms TOM and JONE, but also TOMJON).

  • action=Query&Text=The The

    If you set ProperNames to 0, 1, or 2, the query returns no results (because Content discards both instances of the word The as stop words).

    If you set ProperNames to 3, 4, 5, 6, or 7, only Doc 1 returns (because in all these cases the query to Content has the term THETH or THETHE, which matches only Doc 1).

  • action=Query&Text=the the

    If you set ProperNames to 0, 1, 2, 3, 4, 5, 6, or 7, no results return (because Content discards both instances of the word the as stop words).