Process HTML

Connectors, including the IDOL Web Connector, can send documents to CFS that have associated HTML files.

CFS can send the HTML files to KeyView, which discards the HTML markup and extracts the text contained in the file. However, HTML pages often contain irrelevant content such as invalid HTML, headers, sidebars, advertisements, and scripts. This text does not contain any useful information and could pollute the IDOL index, degrading performance. KeyView does not remove this irrelevant content, so Connector Framework Server provides features to process HTML files.


_FT_HTML5_bannerTitle.htm