Allows you to specify text, documents, or file content and return categories that are similar to the text, documents, or file content. This action also returns terms and weights for each category.
This is an administrative action that can be sent only from AdminClients (which are set in the [Server]
section of the configuration file).
This action uses the SimpleCat algorithm for categorization. SimpleCat is optimized for categorization of small documents that contain only a few words. Other actions (such as CategorySuggestFromText) use the standard categorization algorithm. HPE generally recommends that you use standard categorization unless you are using any of the features that are specific to SimpleCat.
When you use CategorySimpleCategorize
for a document, the results include a confidence value for each SimpleCat that describes how well a document fits into each category. The confidence values sum to 1 over all SimpleCats. HPE IDOL Server then assigns the document to the SimpleCat that has the highest confidence value. This process is different to standard categorization, in which a document might be assigned to no category, or to more than one.
Additionally, with SimpleCat, you can set a default category by using the SimpleCatDefaultCat parameter for the CategorySetDetails action. If a document is empty, or if it contains no words in common with any training document, HPE IDOL Server assigns it to the default SimpleCat.
You can adjust the results by using the SimpleCatParam action parameter for the CategorySetDetails action. You can adjust the importance of word pairs as features (compared to single words) by using the SimpleCatPairFactor configuration parameter.
You can specify cluster job names and exact cluster numbers when you run the CategorySimpleCategorize
action, by using the SourceJobName and Cluster parameters. This option allows you to perform sentiment analysis on clusters.
You can specify secure documents for this action. Use the SecurityInfo parameter to specify the security string to use.
http://12.3.4.56:9000/action=CategorySimpleCategorize&Parent=32349987602210557106&QueryText=Caring for passiflora incarnata
This action uses port 9000
to request categories that are similar to the text Caring for passiflora incarnata from the set of SimpleCat categories contained by the parent category 32349987602210557106
. The categories are requested from HPE IDOL Server, which is located on a machine with the IP address 12.3.4.56
.
Parameter | Description | Required |
---|---|---|
Cluster | The clusters to import to categories. | |
DatabaseMatch | Databases that contain documents to suggest on. | |
DocID | The document ID of the documents to suggest on. | See Comments |
DocRef | The reference of the documents to suggest on. | See Comments |
Language | The language of the query text. | |
MaxTerms | The maximum number of terms and weights to return. | |
Parent | The parent category of the set of SimpleCat categories to use in categorization. | Yes |
PrintFields | The category fields to display in the results. | |
QueryFile | The file name to suggest on. | See Comments |
QueryText | The text to suggest on. | See Comments |
SecurityInfo | The security details for the user that sends the action. | |
SourceJobName | The name of the cluster source job to import to the category hierarchy. | |
Threshold | The relevance threshold of results categories. | |
UserName | The name of the user that sends the action. |
This action accepts the following standard ACI action parameters.
Parameter | Description |
---|---|
ActionID | A string to use to identify an ACI action. |
EncryptResponse | Encrypt the output. |
FileName | The file to write output to. |
ForceTemplateRefresh | Forces the server to load the template from disk. |
Output | Writes output to a file. |
ResponseFormat | The format of the action output. |
Template | The template to use for the action output. |
TemplateParamCSVs | A list of variables to use for the specified template. |
You must specify one of the following parameters:
|