ClippingMode

Clipping removes uninteresting parts of a page such as navigation bars and advertisements, to prevent irrelevant information being added to the IDOL index. Set this parameter to one of the following options:

  • NONE - Do not clip pages.
  • CSSCLIPPING - Clip pages using CSS selectors. To specify the parts of pages to keep and remove, set one or both of the parameters ClipPageUsingCssSelect and ClipPageUsingCssUnselect.
  • READABILITY - Clip pages using the Mozilla readability library. You can configure the behavior of this library by setting options in a JSON file specified by ReadabilityOptionsFile.

    NOTE: This option is not available on FIPS-compliant platforms.

  • SMARTPRINT - Clip pages using the SmartPrint algorithm.

    DEPRECATED: The SMARTPRINT option is deprecated in Connector Framework Server 23.3 and later. It will be removed in a future major release.

Type: String
Default: NONE
Required: No
Configuration Section: Any section that you have defined for WkoopHtmlExtraction settings
Example: ClippingMode=CSSCLIPPING
See Also:

ClipPageUsingCssSelect

ClipPageUsingCssUnselect