Analyze Media

Your connectors might retrieve images, audio, and video from data repositories. KeyView can extract metadata from the media but cannot process its content. To enrich documents that represent media files, you can send the media files to an IDOL Media Server. Media Server processes the files and extracts useful information.

For example, you can run optical character recognition to extract text from scanned documents, or face recognition to recognize faces in photographs. You might want to tag all images and documents that contain your company logo by running object recognition. The information extracted by Media Server is added to the document's metadata or content. After this information has been added to the document, it can be used by other IDOL operations.

Depending on the type of analysis you want to run, you might need to train Media Server. For example, to use face recognition or object recognition, you must train Media Server to recognize specific faces or objects.

How to configure media analysis depends on how you have set up ingestion:

  • IDOL NiFi Ingest. In NiFi, set up a KeyViewRouteOnFileType processor to identify documents that represent rich media files. You can route these documents to a MediaAnalysis processor, which sends the associated media files to your IDOL Media Server.
  • Connector Framework Server. With CFS you can write a Lua script that reads the document metadata to find the file type (KeyView adds metadata fields named DocumentClass and DocumentType containing this information). For relevant documents you can then send a request to Media Server using the analyze_media_in_document Lua function.

You must also configure your Media Server. You must create a configuration to perform the analysis tasks that you want to run. CFS and the NiFi Ingest MediaAnalysis processor retrieve analysis results from the ACI response, so the Media Server session configuration must output data using the ACI (response) output engine.

Micro Focus recommends that you test the Media Server session configuration by manually sending an action to Media Server.