Ingest XML
Many systems export information in XML format and CFS has features to help you convert XML into Knowledge Discovery documents.
NOTE: The XML must be encoded in UTF-8.
You can configure CFS to transform XML files, with an XSL transformation, before they are processed. This is an optional step but can be useful in cases where your XML files do not resemble Knowledge Discovery documents or you are processing XML from many sources and the files have different schemas. You can configure any number of transformations and CFS runs the first transformation where the ingested XML matches the specified schema. You can also configure a default transformation that CFS runs when an XML file does not match any of your schemas.
After an XML file has been transformed, or when transformation is not configured, CFS attempts to convert the XML into Knowledge Discovery documents. The XML is parsed according to the rules that you configure in the [XmlParsing]
section of the CFS configuration file. If the conversion is successful, the resulting metadata-only documents are added to the ingest queue (for more information about the ingestion process, see The Ingestion Process). If the conversion does not result in any Knowledge Discovery documents but the XML was transformed after matching a schema, CFS does not consider this as a failure and does not index any documents. Otherwise, for example if the XML is invalid, the XML file is added to the import queue so that it is processed by File Content Extraction along with other file types.