Connectors, including the IDOL Web Connector, can send documents to CFS that have associated HTML files.
CFS can send the HTML files to KeyView, which discards the HTML markup and extracts the text contained in the file. However, HTML pages often contain irrelevant content such as invalid HTML, headers, sidebars, advertisements, and scripts. This text does not contain any useful information and could pollute the IDOL index, degrading performance. KeyView does not remove this irrelevant content, so HPE Connector Framework Server provides features to process HTML files.
HTML processing with WKOOP. CFS can use an embedded browser (WKOOP) to process HTML in a similar way to the IDOL Web Connector. There are many reasons to use WKOOP over other methods of processing HTML:
To use WKOOP you must also install the IDOL Web Connector, because WKOOP is not provided with CFS.
|