Architectural Overview

The general architecture of the KeyView HTML conversion technology is the same across all supported platforms and is illustrated in the following diagram:

HTML Export Architecture

Each component is described in the following table.

Architectural Components

Component Description
Developer's Application The developer's application interfaces directly with the HTML Export API through either a C or Java implementation.
File Extraction API The File Extraction API opens a file and extracts the file's subfiles so that the subfiles are available for conversion. See Use the File Extraction API.
HTML Export API The HTML Export API exposes the functionality of HTML Export and controls all other HTML Export modules during the conversion process.
Format Detection Module The format detection module determines the file type of the source file, which enables the HTML Export interface to load the appropriate structured access layer module and document reader. See File Format Detection.
Structured Access Layer

The structured access layer contains three modules: one for word processing, one for spreadsheets, and one for presentations and graphics. Information from the format detection module determines which access layer module operates at this stage of the conversion. The structured access layer performs the following:

  1. Loads the appropriate document reader.

  2. Processes the data stream from the document reader.

  3. Determines table of contents entries.

  4. Sends the stream to the appropriate HTML writer.

  5. Accepts the HTML stream from the HTML writer.

  6. Generates the HTML output file with a table of contents, metadata, and the document's contents, and sends it to the HTML Export interface.

Document Reader Each document reader reads a specific file format and sends a text stream of the document to the structured access layer. Word processing readers return a token stream to the structured access layer. A token stream contains the document contents and messages (tokens) that precede the content and identify the type of information that follows them. Each reader is loaded as required by the structured access layer. See Document Readers and Writers for a complete list of document readers.
HTML Writers Each HTML writer accepts a text stream or token stream from the structured access layer and generates an equivalent HTML stream that is sent back to the structured access layer. The structured access layer then generates the output file. See Document Readers and Writers for a list of format writers.