Convert Files

KeyView Export SDK enables you to convert many different types of documents to XML. Converting is the process of extracting the text from a document without the application-specific markup, and applying XML markup. The conversion process can also include the following:

  • Extracting subfiles to expose all subfiles for conversion. See Subfile Extraction.

  • Setting conversion options to determine the content, structure, and appearance of the XML output. See Set Conversion Options.

  • Extracting the file's format to detect a file's format, and report the information to the API, which in turn reports the information to the developer's application. See Extract File Format Information.

  • Extracting metadata (document properties) from a file. See Extract Metadata.

  • Converting character sets to control the character set of both the input and the output text. See Convert Character Sets.

  • Implementing callbacks to control the conversion while it is in progress. See Use Callbacks.

You can use one of the following methods to convert documents:

  • Use the Java implementation of the API. See Use the XML Export API and the Javadoc in the directory install\javaapi\javadoc, where install is the path name of the Export installation directory.

  • Use the Java sample programs. See Sample Programs.

    NOTE: OpenText strongly recommends that you convert documents out of process. During out-of-process conversion, Export runs independently from the calling application. Out-of-process conversions protects the stability of the calling application in the rare case when a malformed document causes Export to fail. Convert Files Out of Process.