CategorySimpleCategorize

Allows you to specify text, documents, or file content and return categories that are similar to the text, documents, or file content. This action also returns terms and weights for each category.

NOTE: This is an administrative action that can be sent only by users that belong to an authorization role that allows the Admin standard role, or which enables the action explicitly. See Authorization Roles Configuration Parameters.

You must set Parent to the name of the parent category that contains your simple categories, and you must specify the document or text to categorize by setting one of DocID, DocRef, QueryText, or QueryFile.

This action uses the SimpleCat algorithm for categorization. SimpleCat is optimized for categorization of small documents that contain only a few words. Other actions (such as CategorySuggestFromText) use the standard categorization algorithm. Micro Focus generally recommends that you use standard categorization unless you are using any of the features that are specific to SimpleCat.

When you use CategorySimpleCategorize for a document, the results include a confidence value for each SimpleCat that describes how well a document fits into each category. The confidence values sum to 1 over all SimpleCats. IDOL Server then assigns the document to the SimpleCat that has the highest confidence value. This process is different to standard categorization, in which a document might be assigned to no category, or to more than one.

Additionally, with SimpleCat, you can set a default category by using the SimpleCatDefaultCat parameter for the CategorySetDetails action. If a document is empty, or if it contains no words in common with any training document, IDOL Server assigns it to the default SimpleCat.

You can adjust the results by using the SimpleCatParam action parameter for the CategorySetDetails action. You can adjust the importance of word pairs as features (compared to single words) by using the SimpleCatPairFactor configuration parameter.

You can specify cluster job names and exact cluster numbers when you run the CategorySimpleCategorize action, by using the SourceJobName and Cluster parameters. This option allows you to perform sentiment analysis on clusters.

You can specify secure documents for this action. Use the SecurityInfo parameter to specify the security string to use.

Example

http://12.3.4.56:9000/action=CategorySimpleCategorize&Parent=32349987602210557106&QueryText=Caring for passiflora incarnata

This action requests categories that are similar to the text Caring for passiflora incarnata from the set of SimpleCat categories contained by the parent category 32349987602210557106.

Required Parameters

The following action parameters are required.

Parameter Description
Parent The parent category of the set of SimpleCat categories to use in categorization.

You must set one of the following parameters.

Parameter Description
DocID The document ID of the document to suggest on.
DocRef The reference of the document to suggest on.
QueryFile The file name to suggest on.
QueryText The text to suggest on.

Optional Parameters

This action accepts the following optional parameters.

Parameter Description
Cluster The clusters to import to categories.
DatabaseMatch Databases that contain documents to suggest on.
Language The language of the query text.
MaxTerms The maximum number of terms and weights to return.
PrintFields The category fields to display in the results.
SecurityInfo The security details for the user that sends the action.
SourceJobName The name of the cluster source job to import to the category hierarchy.
Threshold The relevance threshold of results categories.
UserName The name of the user that sends the action.

This action accepts the following standard ACI action parameters.

Parameter Description
ActionID A string to use to identify an ACI action.
EncryptResponse Encrypt the output.
FileName The file to write output to.
ForceTemplateRefresh Forces the server to load the template from disk.
Output Writes output to a file.
ResponseFormat The format of the action output.
Template The template to use for the action output.
TemplateParamCSVs A list of variables to use for the specified template.