Customize Character Set Detection and Conversion
KeyView attempts to detect the character set of an input file by default. Some character sets (including ANSI, UTF-8, and UTF-16) can be detected by core KeyView functionality but others can only be detected if your license includes advanced character set detection.
If your license includes advanced character set detection, it is enabled by default. However, it can increase the time required to filter some documents. You can disable advanced character set detection on a file-by-file basis, by calling fpFilterConfig() and setting the flag KVFLT_CHARSETDETECTION
to FALSE
. Before setting this flag, be aware that KeyView can not perform character set conversion unless it detects the character set of the source file, or you call fpSetSrcCharSet().
When the character set of the input file is known, KeyView performs character set conversion. You can prevent the default conversion of text to the operating system code page, and specify that Filter retain the original character encoding of the document. Any document identified as containing more than one character encoding is converted to the first encoding encountered in the file.
To prevent the default conversion, set the flag KVF_NODEFAULTCHARSETCONVERT
as the last argument of the call to fpInit() or fpInitWithLicenseData(). This setting overrides any source or target character set specified through the API.