Connector Framework Server

Connector Framework Server is an ACI server. For details of changes that affect all ACI servers, see ACI Server Framework.

24.1.0

There were no new features or resolved issues in this release.

23.4.0

New Features

  • The NIST RDS Filter task and NIST RDS tool have been updated to support the NIST RDSv3 hash sets.

Resolved Issues

There were no resolved issues in this release.

23.3.0

There were no new features or resolved issues in this release.

Notes

  • When you use the WkoopHtmlExtraction task or wkoop_html_processing Lua function, the embedded web browser (WKOOP) is no longer permitted to navigate away from the source page. If necessary you can allow navigation by setting the new configuration parameter AllowNavigations=TRUE, or by setting the named argument allow_navigations in the Lua function call.

23.2.0

New Features

  • The WkoopHtmlExtraction task can use the Mozilla readability library to clip pages. Clipping removes uninteresting parts of a page such as navigation bars and advertisements, to prevent irrelevant information being added to the IDOL index. Automatic clipping was available in previous versions of CFS but the readability library produces better results in some cases. To clip pages using the readability library, set ClippingMode=READABILITY. This feature is not available on FIPS-compliant platforms.
  • When extracting metadata from a page, the WkoopHtmlExtraction task can write the information into structured document fields. Earlier versions of CFS could not be configured to write this information into sub-fields.

Resolved Issues

  • CFS could terminate unexpectedly when ImportExtractExternal=TRUE and a file contained multiple URLs to external files.

Notes

  • As a result of the improvements to clipping, you must update any WkoopHtmlExtraction tasks that perform clipping. The Clipped parameter has been removed and replaced with a new parameter, ClippingMode. For more information, please refer to the Connector Framework Server Help.
  • As a result of the improvements to metadata extraction, you must update any WkoopHtmlExtraction tasks that extract metadata.

    • In earlier versions of CFS, the MetadataSelector and MetadataFieldName parameters accepted multiple values. With CFS 23.2 these parameters accept a single value and you should use the new parameter MetadataFieldSections to specify the names of sections that contain these parameters (one section for each field that you want to create in your IDOL documents).

      The configuration parameter MetadataSelectorExtractPlainText has been renamed to MetadataExtractPlainText.

      For example:

      CFS 12.13 CFS 23.2
      [WkoopHtmlExtractionTask]
      ...
      MetadataSelector0=h1
      MetadataFieldName0=HeadingOne
      MetadataSelectorExtractPlainText=TRUE
      [WkoopHtmlExtractionTask]
      ...
      MetadataFieldSections0=ExtractH1
      
      [ExtractH1]
      MetadataSelector=h1
      MetadataFieldName=HeadingOne
      MetadataExtractPlainText=TRUE
    • The configuration parameters ChildMetadataSelector, ChildMetadataSelectorExtractPlainText, ChildMetadataFieldName, and ChildMetadataAttribute have been removed. You can now extract metadata for child documents using the same parameters that you would use for the main document. Use the new parameter ChildDocumentMetadataFieldSections to specify the names of sections that contain settings for metadata extraction from child documents.

Deprecated Features

The following features are deprecated and might be removed in a future release.

Category Deprecated Feature Deprecated Since
WKOOPHtmlExtraction The SMARTPRINT clipping mode has been deprecated. 23.3.0