Key to Supported Formats Table

The supported formats table includes the following information:

Column Description
Format Name

The format name that is returned by file format detection.

  • In the C API, these values are defined in the ENdocFmt enumeration in adDocFmt.h.
  • In the Java API these values are defined in the com.verity.api.DocFormat enumeration.
Number

The format number that is returned by file format detection. This is the value associated with the Format Name in the relevant enumeration.

The format number is used in the configuration file, formats_e.ini, to specify the reader to use to export the format.

Description A short description of the file format.
MIME Type The MIME type (if any).
Extension

A list of common file extensions for the file format.

NOTE: This is not a complete list of file extensions. File Content Extraction does not distinguish between file types based on their extension. Instead, it detects the file format based on the file content. This is more reliable because content cannot always be predicted from the file extension, and because some file extensions are associated with multiple formats.

Encrypted

Many file formats offer encryption to protect file content. This column specifies whether File Content Extraction can detect encrypted forms of the file format. If there is a check mark in this column, the encrypted property might be set in the results from format detection.

  • In the C API, see the ulAttributes member of the ADDOCINFO struct, and the kEncrypted enumerator of the ENDocAttributes enumeration.
  • In the Java API, see the isEncrypted() method of the DocFormatInfo class.
Metadata Specifies whether File Content Extraction can extract metadata from the file format.
File Class

The file class.

  • In the C API, these values are defined in the ENdocClass enumeration in adinfo.h.
  • In the Java API these values are defined in the com.verity.api.DocClass enumeration.