CJKNormalization

This parameter allows you to specify how to normalize Chinese, Japanese, and Korean data before extraction, in all Eduction Server components.

You can specify the value of CJKNormalization as follows:

  • Kana. Half width kana to full width kana.
  • OldNew. Old kanji to new kanji.
  • Number. Chinese or kanji number characters to ASCII number characters.
  • HWNum. Full width number characters to ASCII number characters.
  • HWAlpha. Full width alphabet characters to ASCII alphabet characters.
  • SimpChi. Traditional Chinese to simplified Chinese.
  • FWJamo. Half width jamo to full width jamo.

Separate multiple options with a comma.

Action: EduceFromFile, EduceFromText, RedactFromFile, RedactFromText
Type: String
Default: The value of the CJKNormalization configuration parameter
Example: CJKNormalization=SimpChi,Kana
See Also: