KeyView Export SDK

23.2.0

New in this Release

  • KeyView format detection has been extended, with support for 56 additional file formats. By identifying a larger range of formats present in the enterprise, decisions can be made on how to route, filter, or alert on such documents. For the full list, refer to the KeyView Export SDK Programming Guides.

  • KeyView 23.2 introduces a new metadata API. The new API:

    • reduces the number of function calls you need to make to retrieve all metadata.
    • performs field standardization. Field standardization returns metadata using a standard set of field names, so that the same metadata is returned in the same field regardless of the source file format. The new metadata API allows for the introduction of further standardization, in future releases, without breaking backwards compatibility.

    The new metadata API is available in KeyView Filter (C and C++). The new metadata API can also be used through KeyView Export (C), but only when extracting subfile metadata.

  • Error reporting has been simplified in the C API. In earlier versions of KeyView, some functions could return the error code KVERR_General. You could then call fpGetKvErrorCodeEx() to obtain an "extended" error code. In KeyView 23.2 the error codes have been unified such that all error codes are included in the KVErrorCodeenumeration. If a function returns an error code, there is no need to call a second function to obtain more information. This makes it easier to handle errors when an operation fails.

  • KeyView has been simplified so that it is much easier to map file formats to readers. File formats no longer have an associated "category". The KeyView configuration files such as formats.ini, formats_e.ini, and kvsdk.ini now identify file formats using the same file format numbers that are returned by format detection.

    For example, when KeyView detects an Adobe PDF file it returns format number 230. Imagine that you want to process PDF files with the reader pdf2sr.

    In previous versions of KeyView you had to find the associated format category (200) and use this to configure KeyView:

    200=pdf2

    In KeyView 23.2, this is no longer necessary and you instead use the same format number that is returned from format detection:

    230=pdf2
  • KeyView can now extract all platform-specific embedded files from PDF_Fmt documents.

  • When you enable 'show hidden text', KeyView can now output author names for comments in Rich Text Format (MS_RTF_Fmt) documents.

  • KeyView can now process certain OpenOffice Text files that it would previously reject, and it can process image alt text in OpenOffice Text as hidden text.

  • When getting mail metadata from EML subfiles, KeyView now reports the sent date as a date instead of a string.

  • When getting subfile information from EML subfiles, KeyView now converts the file time to UTC, rather than an unspecified time zone.

  • KeyView now supports TIFF (TIFF_Fmt) files that use WebP compression.

  • Handling of Arabic diacritics (tashkil) has been significantly improved when using the pdfsr reader to process PDFs.

  • Text ordering has been improved when using the pdfsr reader to process PDFs.

  • The FreeType third-party library has been upgraded to version 2.12.1.

  • The ODA third-party library has been upgraded to version 2023.12.

  • The zlib third-party library has been upgraded to version 1.2.13.

  • The libxml2 third-party library has been upgraded to version 2.10.3.

  • The expat third-party library has been upgraded to version 2.5.0.

  • The ICU third-party library has been upgraded to version 72.1.

  • The openssl third-party library has been upgraded to version 3.0.8.

  • The libde265 third-party library has been upgraded to version 1.0.11.

  • The XMP-Toolkit third-party library has been upgraded to version 2022.06.

  • The wavpack third-party library has been upgraded to version 5.6.0.

  • The sqlite third-party library has been upgraded to version 3.41.0.

Resolved Issues

  • (Security update) The third-party libtiff library has been upgraded to version 4.5.0 to resolve known vulnerabilities, including CVE-2022-2056, CVE-2022-2057, CVE-2022-2058, CVE-2022-3452, CVE-2022-3570, CVE-2022-3597, CVE-2022-3598, CVE-2022-3599, CVE-2022-3626, and CVE-2022-3627.
  • (Security update) The third-party protobuf library has been upgraded to version 3.21.12 to resolve known vulnerabilities, including CVE-2022-1941.

  • (Security update) The libjpeg third-party library has been upgraded to version 9e to resolve potential vulnerabilities.
  • (Security update) The libwebp third-party library has been upgraded to version 1.3.0.

  • For password protected OpenOffice files (ODS, ODT and ODP), fpOpenFile did not return KVERR_PasswordProtected. Continuing with extraction could then result in invalid extracted files.

  • KeyView could truncate long sections of text in PDF_Fmt documents.

  • KeyView did not retrieve the Image Width, Image Height and Bits Per Pixel in summary information from Tagged Image File Format (TIFF) TIFF_Fmt files.

  • For some Microsft Excel (XLSX) files with a lot of cells using Rich Data Types, KeyView output the names of those types incorrectly, using a number instead of a type name.

  • Some C sample programs could loop endlessly when a bad argument was passed in.

  • KeyView could skip some user defined properties in summary information for some OLE-based files like MS_Project_2007_Fmt.

  • When using the pdfsr reader for text in right-to-left languages, diacritic characters were sometimes not extracted correctly.

  • KeyView did not extract all the images from some Rich Text Format (MS_RTF_Fmt) documents.

  • The extraction API fpGetSubFileInfo function did not correctly report the sizes of subfiles when they were larger than 2GB.

  • Some PDF files took longer to process in version 12.13.0 of the SDK than in version 12.12.0.

  • Heic and Heif format documents could not be processed on macOS.

  • KeyView could return an error (out-of-process), or exit unexpectedly (in-process) when processing some Microsoft Visio (.vsd) files.

  • KeyView missed text from some Microsoft Visio (.vsd) files.

  • Some base-64 encoded attachments to ICS files were extracted incorrectly.

  • When converting spreadsheet files to HTML, KeyView removed all empty rows even when bRemoveEmptyRows was set to FALSE.

  • KeyView could report duplicate metadata from Tagged Image File Format (TIFF) files with multiple pages.

  • When using kvhtmlexport to export container files, internal container subfile pages were misnamed as subfilen.temp.

  • KeyView could output incorrect metadata names for some PDF files.

  • KeyView could omit metadata entries for some PDF files.

  • KeyView could be slow to start the out-of-process session if called on more threads than there were ports configured.

  • When using the pdfsr reader to process PDFs that contained right-to-left (RTL) text, some text at the top of the file was not included in the output.

  • KeyView could process some CSV files incorrectly, meaning fields were output in the wrong columns.

  • KeyView would terminate unexpectedly (in-process), or return an error (out-of-process), when processing a PDF document with the reader kpPDF2rdr, if the input was an input stream that was not created by KeyView.
  • The reader kpPDF2rdr was not thread-safe.
  • The reader kpPDF2rdr failed to release memory each time a file was processed.
  • The reader kpPDF2rdr could output incorrect values for page width and height.

Notes

KeyView 23.2 is a new major version of IDOL, released in the second quarter of 2023. It is the first new major version since KeyView 12.0 was released in June 2018. KeyView 23.2 includes some changes that require you to update your license and application code. For more information about how to upgrade, see the KeyView upgrade guide.

Deprecated Features

The following features are deprecated and might be removed in a future release.

Category Deprecated Feature Deprecated Since
Readers

The following readers have been deprecated:

  • cebsr

  • lwpsr

23.2.0

Requirements

For information about supported platforms, supported compilers, and software dependencies for the KeyView Export SDK, refer to the KeyView Export SDK Programming Guides.

Documentation

The following documentation is available for KeyView Export SDK version 23.2.0.

  • KeyView Export SDK: HTML Export C Programming Guide

  • KeyView Export SDK: HTML Export Java Programming Guide

  • KeyView Export SDK: PDF Export C Programming Guide

  • KeyView Export SDK: XML Export C Programming Guide

  • KeyView Export SDK: XML Export Java Programming Guide