KeyView Viewing SDK

24.2.0

New in this Release

  • KeyView format detection has been extended, with support for 51 additional file formats. KeyView can now identify more than 2000 unique file formats. By identifying a larger range of formats present in the enterprise, decisions can be made on how to route, filter, or alert on such documents. For the full list, refer to the KeyView Viewing SDK Programming Guide.

  • Performance has been improved for large Rich-Text Format (RTF) files.

  • KeyView can now present data from Pipe_Separated_Fmt files as a table.

  • The third-party libxml2 library has been updated to version 2.12.0.

  • The third-party zStandard library has been upgraded to version 1.5.5.

Resolved Issues

  • (Security update)The third-party openssl library has been upgraded to version 3.2.1 to resolve security potential vulnerabilities including CVE-2023-4807,CVE-2023-5363, CVE-2023-5678, CVE-2023-6237, CVE-2024-0727.

  • (Security update) The third-party sqlite library has been upgraded to version 3.45.1 to resolve known vulnerabilities, including CVE-2023-7104.

  • (Security update) The third-party libde265 library has been upgraded to version 1.0.15 to resolve known vulnerabilities, including CVE-2023-27102, CVE-2023-27103, CVE-49465, CVE-49467, and CVE-49468.

  • (Security update) The third-party libheif library has been upgraded to version 1.17.6 to resolve known vulnerabilities, including CVE-2023-49462, and CVE-2023-49463.

  • (Security update) The third-party XMPToolkit library has been upgraded to version 2023.12 to resolve known vulnerabilities.

  • (Security update) The third-party expat library has been upgraded to version 2.6.0 to resolve known vulnerabilities, including CVE-2023-52425 and CVE-2023-52426.

  • When processing Microsoft Word (docx) files that contained charts, KeyView sometimes left files in the temporary directory.

  • KeyView sometimes output some nonsense text when processing the headers and footers of Microsoft Word documents (.doc).

  • When a PDF had an non-valid subform, it could lead to a bad input stream error for the whole file. KeyView now skips the non-valid subform and attempts to retrieve information from the rest of the file.

Notes

  • KeyView 24.2 no longer includes the deprecated cebsr and lwpsr readers.

24.1.0

New in this Release

  • KeyView format detection has been extended, with support for 40 additional file formats. By identifying a larger range of formats present in the enterprise, decisions can be made on how to route, filter, or alert on such documents. For the full list, refer to the KeyView Viewing SDK Programming Guide.

  • KeyView now produces tabular output for TSV files.

  • KeyView output now includes URLs in OpenDocument Spreadsheet (.ODS) documents and outputs them as hidden text.

  • Text-only support has been added for the RSS syndication XML format (RSS_Fmt).

  • When configured to display formula strings, KeyView now supports many more formulas from Microsoft Excel (.xls) files.

  • The third-party libpng library was upgraded from version 1.6.37 to version 1.6.40.

  • The third-party Lib ICU library was upgraded to version 73.2.

  • The third-party zlib library was upgraded from version 1.2.13 to version 1.3.

Resolved Issues

  • (Security update) The third-party OpenSSL library was updated to version 3.1.4.
  • (Security update) A potential security vulnerability was resolved for processing WebP_Fmt files or TIFF_Fmt files that use webp compression in the Viewing SDK. This change addresses CVE-2023-4863.

  • (Security update) The assr, awsr, kpcgmrdr, and orcsr readers had potential buffer overruns.

  • Processing some valid ODS files failed with the error KVError_ArchiveFatalError.

  • KeyView did not use consistent naming conventions for mail subfiles across different file formats.

  • On non-windows platforms, KeyView reported invalid bytes at the end of subfiles names for some ZIP-based formats, such as JAR files.

  • MBox (.mbx) files that were signed but not encrypted returned KVError_PasswordProtected.

  • KeyView could identify some files of type GDSII as MacWrite format.

23.4.0

New in this Release

  • KeyView format detection has been extended, with support for 48 additional file formats. By identifying a larger range of formats present in the enterprise, decisions can be made on how to route, filter, or alert on such documents. For the full list, refer to the KeyView Viewing SDK Programming Guide.

  • KeyView now supports Microsoft Word documents with Landscape pages.

  • KeyView now supports metadata extraction for current (2013 to the present day) Apple iWork Pages, Numbers and Keynote files.

  • Text-only viewing support has been added for the ZIP variant of Uniform Office Format word processor files (Uniform_Office_Text_Zip_Fmt).

  • The PDF reader (pdfsr) has been improved for Arabic text in some circumstances.

    When you use Microsoft Print to PDF to convert Word documents that contain Arabic text in certain fonts to PDF, the resulting file is often incomplete, missing character mapping information that is required to interpret the text content. The pdfsr reader was previously able to reconstruct this missing information for most cases where the text was in Calibri font. The pdfsr reader can now also reconstruct the information for Sakkal Majalla font, and for additional Calibri cases.

    Furthermore, pdfsr now attempts to reconstruct the information when a character is mapped to the Unicode replacement character codepoint, rather than only when a character's mapping information is missing entirely.

  • KeyView can now extract text from some PDF documents that contain Type 3 fonts without unicode character mapping information, rather than always converting all characters in those fonts to the configured replacement character. This process is not guaranteed to result in readable output for all such files.

  • The third-party openssl library has been upgraded to version 3.1.2.

  • (Security update) The third-party libwebp library has been upgraded to version 1.3.1.

  • (Security update) The third-party libxml2 library has been upgraded to version 2.11.4.

  • (Security update) The third-party freetype library has been upgraded to version 2.13.1.

  • The third-party libheif library has been upgraded to version 1.16.2.

Resolved Issues

  • When viewing landscape pages in Microsoft Word documents, part of the page was hidden by the right-hand side page border, if configured.

  • When viewing Microsoft Word documents that contained tables, KeyView sometimes displayed an incorrect number of pages.

  • On macOS, KeyView included a version of libxml2.dylib, which could make it difficult for calling applications to use their own or the system version of libxml2.dylib.

  • KeyView did not set the KVMainFileInfoFlag_HasContent flag for pFiles (RMS_Protected_Fmt).

  • KeyView sometimes failed to output the text from some cells in tables in current (2013 to the present day) Apple iWork Numbers and KeyNote files.

  • KeyView could exit unexpectedly (in-process) or return an error (out-of-process) when extracting from certain MIME email files.

  • KeyView could exit unexpectedly (in-process) or return an error (out-of-process) when processing certain PDF and ODS files.

  • The supplied default configuration for the Viewing SDK did not allow you to view 7-Zip archive (Z7Z_Fmt) files.

  • (Security update) The third-party ODA library was upgraded to 2024.7 on Windows and Linux platforms, and 2024.1 on Mac platforms to resolve known vulnerability CVE-2023-26495.

  • KeyView leaked memory when using the text member of Document objects in the C++ and Python APIs.

  • KeyView returned an error when retrieving headers and footers from certain MS Word (docx) documents.

  • KeyView returned an error when processing PDF documents containing encrypted forms.

  • KeyView returned an error when processing certain PDF documents.

  • KeyView sometimes included duplicate text in the output when PDF documents contained the same text multiple times in the same location.

  • When viewing Excel Binary File Format (.xls) files, KeyView did not display the contents of some cells in lower rows of the spreadsheet.

  • KeyView was unable to use the Viewing SDK to view some AutoDesk AutoCAD Drawing file (AutoDesk_DWG_Fmt) documents.

  • KeyView sometimes exited unexpectedly while shutting down the kpodardr library when using the Viewing SDK on some AutoDesk AutoCAD Drawing file (AutoDesk_DWG_Fmt) documents.

23.3.0

New in this Release

  • KeyView format detection has been extended, with support for 61 additional file formats. By identifying a larger range of formats present in the enterprise, decisions can be made on how to route, filter, or alert on such documents. For the full list, refer to the KeyView Viewing SDK Programming Guide.

  • The third-party sqlite library was upgraded to version 3.42.0.

Resolved Issues

  • (Security update) KeyView depended on outdated LZMA code for detection of 7z encryption. This dependence has been removed.

  • When using the pdfsr reader to process PDFs that contained fonts with certain predefined encodings, spaces sometimes appeared in the wrong places in output text.

  • When using the pdfsr reader to process PDFs, output text was sometimes broken onto additional new lines or missed characters.

  • KeyView leaked memory when filtering some Apple iWork Keynote (.key) (IWPG_Fmt) files.

  • KeyView incorrectly identified some attachments to Outlook .msg files as being inline pictures rather than non-inline attachments.

  • Changing the global locale could cause KeyView to fail to process some file formats.

  • When attempting to filter NIST_ITL_Fmt files, KeyView returned KVERR_FormatNotSupported.

  • Some child windows were not displayed in the viewing window. For example, this issue meant that mail bodies were not displayed and attachments could not be opened.

  • When viewing Microsoft Excel documents (XLSX) that contained merged cells, KeyView could exit unexpectedly.

  • When processing HTML files that specify an ISO-2022 character encoding, KeyView output some entities as unprocessed strings, rather than the correct unicode character.

23.2.0

New in this Release

  • KeyView format detection has been extended, with support for 56 additional file formats. By identifying a larger range of formats present in the enterprise, decisions can be made on how to route, filter, or alert on such documents. For the full list, refer to the KeyView Viewing SDK Programming Guide.

  • KeyView has been simplified so that it is much easier to map file formats to readers. File formats no longer have an associated "category". The KeyView configuration files such as formats.ini, formats_e.ini, and kvsdk.ini now identify file formats using the same file format numbers that are returned by format detection.

  • KeyView can now extract all platform-specific embedded files from PDF_Fmt documents.

  • KeyView can now process certain OpenOffice Text files that it would previously reject, and it can process image alt text in OpenOffice Text as hidden text.

  • KeyView now supports TIFF (TIFF_Fmt) files that use WebP compression.

  • Handling of Arabic diacritics (tashkil) has been significantly improved when using the pdfsr reader to process PDFs.

  • Text ordering has been improved when using the pdfsr reader to process PDFs.

  • The FreeType third-party library has been upgraded to version 2.12.1.

  • The ODA third-party library has been upgraded to version 2023.12.

  • The zlib third-party library has been upgraded to version 1.2.13.

  • The libxml2 third-party library has been upgraded to version 2.10.3.

  • The expat third-party library has been upgraded to version 2.5.0.

  • The ICU third-party library has been upgraded to version 72.1.

  • The openssl third-party library has been upgraded to version 3.0.8.

  • The libde265 third-party library has been upgraded to version 1.0.11.

  • The XMP-Toolkit third-party library has been upgraded to version 2022.06.

  • The wavpack third-party library has been upgraded to version 5.6.0.

  • The sqlite third-party library has been upgraded to version 3.41.0.

  • The third-party libical library was upgraded to version 3.0.16.

  • The third-party Apache Arrow library was upgraded to version 11.0.0. This change includes upgrades to the following dependencies: 

    • boost was upgraded to version 1.75.0

    • brotli was upgraded to version1.0.9

    • jemalloc was upgraded to version 5.3.0

    • re2 was upgraded to version 2022-06-01

    • thrift was upgraded to version 0.16.0

    • utf8proc was upgraded to version 2.7.0

    • zStandard was upgraded to version 1.5.2

Resolved Issues

  • (Security update) The third-party libtiff library has been upgraded to version 4.5.0 to resolve known vulnerabilities, including CVE-2022-2056, CVE-2022-2057, CVE-2022-2058, CVE-2022-3452, CVE-2022-3570, CVE-2022-3597, CVE-2022-3598, CVE-2022-3599, CVE-2022-3626, and CVE-2022-3627.
  • (Security update) The third-party protobuf library has been upgraded to version 3.21.12 to resolve known vulnerabilities, including CVE-2022-1941.

  • (Security update) The libjpeg third-party library has been upgraded to version 9e to resolve potential vulnerabilities.
  • (Security update) The libwebp third-party library has been upgraded to version 1.3.0.

  • KeyView could truncate long sections of text in PDF_Fmt documents.

  • KeyView did not retrieve the Image Width, Image Height and Bits Per Pixel in summary information from Tagged Image File Format (TIFF) TIFF_Fmt files.

  • KeyView did not view TIFF documents with images in multiple colourspaces correctly.

  • For some Microsft Excel (XLSX) files with a lot of cells using Rich Data Types, KeyView output the names of those types incorrectly, using a number instead of a type name.

  • In Word Processor formats such as Microsoft Word (DOCX), embedded images with text wrapping around them were sometimes rendered above the text, obstructing it. To avoid this, ViewingSDK now places images at the start of paragraphs to ensure that the text is readable, making the output more consistent with HTML Export.

  • KeyView could skip some user defined properties in summary information for some OLE-based files like MS_Project_2007_Fmt.

  • When using the pdfsr reader for text in right-to-left languages, diacritic characters were sometimes not extracted correctly.

  • Some PDF files took longer to process in version 12.13.0 of the SDK than in version 12.12.0.

  • KeyView could exit unexpectedly when processing some Microsoft Visio (.vsd) files.

  • KeyView missed text from some Microsoft Visio (.vsd) files.

  • Some base-64 encoded attachments to ICS files were extracted incorrectly.

  • KeyView could report duplicate metadata from Tagged Image File Format (TIFF) files with multiple pages.

  • KeyView could output incorrect metadata names for some PDF files.

  • KeyView could omit metadata entries for some PDF files.

  • When using the pdfsr reader to process PDFs that contained right-to-left (RTL) text, some text at the top of the file was not included in the output.

  • KeyView could process some CSV files incorrectly, meaning fields were output in the wrong columns.

  • The reader kpPDF2rdr was not thread-safe.
  • The reader kpPDF2rdr failed to release memory each time a file was processed.
  • The reader kpPDF2rdr could output incorrect values for page width and height.