Language Types

Each document that you want to index into the Content component has a language and an encoding. In Content, the combination of the language and encoding is the Language Type.

All indexed documents have a language type.

  • When you index a document, you can:

  • Specify the language type that Content must use.

  • Configure Content to automatically detect the language type.

Specify the Language Type

When you create documents, you can add a field to the document that contains the language type. You can then configure Content to read the value of this field and use it to define the language type.

Alternatively, when you send an document to Content for indexing, you can provide the language type in an action parameter. Content applies this language type to all the documents included in the index action.

Automatic Language Detection

When you turn on Automatic Language Detection, Content automatically finds the language and encoding of a document, and assigns it to the appropriate language type.

With Automatic Language Detection, you can also configure the language type that Content must use if:

See the main topic on Automatic Language Detection.