CategorySimpleCategorize

Allows you to specify text, documents, or file content and return categories that are similar to the text, documents, or file content. This action also returns terms and weights for each category.

NOTE:

This is an administrative action that can be sent only from AdminClients (which are set in the [Server] section of the configuration file).

This action uses the SimpleCat algorithm for categorization. SimpleCat is optimized for categorization of small documents that contain only a few words. Other actions (such as CategorySuggestFromText) use the standard categorization algorithm. HPE generally recommends that you use standard categorization unless you are using any of the features that are specific to SimpleCat.

When you use CategorySimpleCategorize for a document, the results include a confidence value for each SimpleCat that describes how well a document fits into each category. The confidence values sum to 1 over all SimpleCats. HPE Category Component then assigns the document to the SimpleCat that has the highest confidence value. This process is different to standard categorization, in which a document might be assigned to no category, or to more than one.

Additionally, with SimpleCat, you can set a default category by using the SimpleCatDefaultCat parameter for the CategorySetDetails action. If a document is empty, or if it contains no words in common with any training document, HPE Category Component assigns it to the default SimpleCat.

You can adjust the results by using the SimpleCatParam action parameter for the CategorySetDetails action. You can adjust the importance of word pairs as features (compared to single words) by using the SimpleCatPairFactor configuration parameter.

You can specify cluster job names and exact cluster numbers when you run the CategorySimpleCategorize action, by using the SourceJobName and Cluster parameters. This option allows you to perform sentiment analysis on clusters.

You can specify secure documents for this action. Use the SecurityInfo parameter to specify the security string to use.

Example

http://12.3.4.56:9000/action=CategorySimpleCategorize&Parent=32349987602210557106&QueryText=Caring for passiflora incarnata

This action uses port 9000 to request categories that are similar to the text Caring for passiflora incarnata from the set of SimpleCat categories contained by the parent category 32349987602210557106. The categories are requested from HPE Category Component, which is located on a machine with the IP address 12.3.4.56.

Parameters

Parameter Description Required
Cluster The clusters to import to categories.  
DatabaseMatch Databases that contain documents to suggest on.  
DocID The document ID of the documents to suggest on. See Comments
DocRef The reference of the documents to suggest on. See Comments
Language The language of the query text.  
MaxTerms The maximum number of terms and weights to return.  
Parent The parent category of the set of SimpleCat categories to use in categorization. Yes
PrintFields The category fields to display in the results.  
QueryFile The file name to suggest on. See Comments
QueryText The text to suggest on. See Comments
SecurityInfo The security details for the user that sends the action.  
SourceJobName The name of the cluster source job to import to the category hierarchy.  
Threshold The relevance threshold of results categories.  
UserName The name of the user that sends the action.  

This action accepts the following standard ACI action parameters.

Parameter Description
ActionID A string to use to identify an ACI action.
EncryptResponse Encrypt the output.
FileName The file to write output to.
ForceTemplateRefresh Forces the server to load the template from disk.
Output Writes output to a file.
ResponseFormat The format of the action output.
Template The template to use for the action output.
TemplateParamCSVs A list of variables to use for the specified template.

Comments

You must specify one of the following parameters:


_HP_HTML5_bannerTitle.htm