Glossary

    A
  • Autonomy Content Infrastructure. ACI is the method that Autonomy components use to communicate with each other.
  • A server component that runs on the Autonomy Content Infrastructure (ACI).
  • A request sent to an ACI Server.
  • A process that searches for information about a specific topic, according to a training query or text.
  • Adaptive Probabilistic Concept Modeling. A statistical method, used by IDOL server to calculate query result weights and ranks.
  • C
  • A set of criteria that define a particular topic, which you can use to categorize documents that contain content relevant to the topic.
  • An automatically identified set of related documents.
  • A group of users with related interests or expertise.
  • Autonomy Connectors extract content from source repositories, and send them to CFS for processing and indexing.
  • CFS processes data from connectors and sends it to IDOL for indexing.
  • D
  • An IDOL server database is a data pool that stores indexed information. You can restrict queries to a particular database.
  • DAH distributes actions to multiple versions of IDOL server or a component. It allows you to use failover, load balancing, or distributed content.
  • DIH distributes index actions to multiple versions of IDOL server or an indexing component. It allows you to copy or distribute your content.
  • E
  • The process of extracting entities (patterns of text) from documents.
  • A word, phrase, or block of information that the Eduction component can match and extract from documents.
  • In Eduction, an entity is a word, phrase, or block of information. An entity can be a specific text string, such as a name, or it can be a pattern of text such as an address or phone number. You define the pattern in a grammar, which Eduction uses to find the entities in documents.
  • The process of extracting text, metadata, and subfiles from documents. IDOL KeyView performs this extraction process in IDOL.
  • F
  • A group of settings that instruct a connector how to retrieve data from a repository. Connectors can run fetch tasks automatically, or in response to an action.
  • Fields define different parts of content in IDOL documents, such as the title, content, and metadata information.
  • G
  • In Eduction, a grammar is a pattern that defines an entity.
  • I
  • An encoded value that identifies the source of a document in IDOL server. Connectors and CFS add identifiers to every document that they create for index into IDOL Server. They store this value in the AUTN_IDENTIFIER field.
  • Intelligent Data Operating Layer. The IDOL data platform allows you to convert your source content to an index of useful data that you can search and analyze.
  • The core server in the IDOL data platform. IDOL server performs content processing, indexing, search and retrieval, and many of IDOL's classification, user, and security functions.
  • A format of documents that IDOL server uses for indexing.
  • Importing is the process where CFS, using KeyView, extracts metadata, content, and sub-files from items retrieved by a connector. CFS adds the information to documents so that it is indexed into IDOL Server. Importing allows IDOL server to use the information in a repository, without needing to process the information in its native format.
  • A processing task run by CFS on new documents before they are indexed into to IDOL Server. Import tasks can run before or after (pre- or post-) KeyView filtering.
  • The IDOL server data index contains document content and field information for analysis and retrieval.
  • A processing task run by CFS on documents. Update-index tasks run when a document's metadata (but not its content) is updated. Delete-index tasks run when a document is deleted from a repository.
  • The process of adding information to the IDOL index. This process includes all linguistic processing, storing information for optimized retrieval, and storing document content.
  • Ingestion converts information that exists in a repository into documents that can be indexed into IDOL Server. Ingestion starts when a connector finds new documents in a repository, or documents that have been updated or deleted, and sends this information to CFS. Ingestion includes the import process, and processing tasks that can modify and enrich the information in a document.
  • K
  • The IDOL component that extracts data, including text, metadata, and subfiles from over 1,000 different file types.
  • L
  • An embedded scripting language that you can use to write custom scripts to expand certain IDOL functionality.
  • P
  • Information about a user, based on the concepts in documents that they access. IDOL server automatically creates profiles for users, according to their interests.
  • Q
  • A search string that you can use to retrieve information from IDOL server that matches your syntax. Queries include simple conceptual queries, and complicated strings with Boolean, proximity, and field restriction expressions.
  • A server that manipulates incoming searches to modify the query, modify the results set, or return promotions.
  • R
  • A string that is used to identify a document. This might be a title or a URL, and allows IDOL to identify documents for retrieval, indexing, and deduplication.
  • S
  • Security includes anything that makes sure that only authorized users can access or perform actions on data. It includes making sure that only permitted users can view and retrieve documents, user authentication, and secure communications.
  • In clustering with snapshots, a seed is a potential cluster. It contains a document, and suggested conceptually similar documents from the IDOL server index.
  • A form of Eduction that identifies positive and negative sentiment in text.
  • The process of finding a common root for related words. IDOL uses this common root to search, so it finds the concept rather than the particular term.
  • A list of stop words, which IDOL uses to discard these terms at index and query time, which can improve performance.
  • A word in a language that occurs frequently and does not add much meaning to text.
  • A file that has been extracted from a container (such as a .ZIP archive).
  • A few sentences or paragraphs that describe what a document is about. IDOL can automatically create summaries from document content.
  • T
  • A hierarchical structure of categories, or other information, which gives you an overview of your data.
  • The basic entity that IDOL indexes; for example, a word after stemming.
  • Terms and Weights. These are used in categorization to define the most important terms that define a category topic.
  • Text, documents, and query syntax used to define the topic that an agent or category must match.
  • U
  • Wildcard searches (and some advanced search techniques) use the index of unstemmed terms. This index contains all the exact terms that occur in documents.
  • V
  • An IDOL component that converts files in a repository to HTML formats for viewing in a Web browser.