Optical Character Recognition

When processing raster image files, KeyView can perform Optical Character Recognition (OCR) to attempt to filter text that might be visible in the image. If text is detected to form part of a table, it will be filtered in the same way as tables in Word Processing documents.

NOTE: KeyView performs OCR only on standalone raster files, not on images embedded inside other documents. For embedded images, you must first extract the images by using the Extract Images option.

NOTE: OCR is available only on certain platforms (see Optical Character Recognition in the platform differences section).

If your license includes OCR, it is enabled by default.

To enable or disable OCR

  • Call the ocr method on your session configuration. For example:

    # To disable OCR...
    session.config.ocr(False)