The [FieldProcessing]
section in the IDOL Content component configuration file allows you to identify particular fields in documents. You can then apply any type of processing to them or the document that contains them during the indexing process, depending on the field value.
In this way you can apply multiple processes to documents without needing to set up a configuration section for each process combination.
When identifying fields, use the following formats:
/FieldName
to match root-level fields.
*/FieldName
to match all fields except root-level.
/Path/FieldName
to match fields that the specified path points to.
Field names must not contain spaces, accents, or multibyte characters, and they must not start with a number. For IDX documents, Content converts these text elements to underscores (_) when it indexes the fields. You must also change any queries that reference these field names to use the modified field name.
Open the IDOL Content component configuration file in a text editor.
In the [FieldProcessing]
section, list the processes to apply to fields. For example:
[FieldProcessing] 0=MyFirstProcess 1=IndexFields 2=MyCombinedProcess 3=IndexAndWeightHigher
Create a section for each process that you listed. In each section, declare a property for the process (you define the property later by setting one or more applicable configuration parameters). Identify the fields to associate with the processes.
You can use the PropertyMatch
parameter to identify a specific value that fields must have to be processed. (This is useful if you set up a process that identifies security or language fields.)
The properties that you create must not have the same name as the processes.
For example:
[MyFirstProcess] Property=MyFirstProperty PropertyFieldCSVs=*/MyField,*/MySecondField PropertyMatch=*myString* [IndexFields] Property=MySecondProperty PropertyFieldCSVs=*/DRECONTENT,*/DRETITLE [MyCombinedProcess] Property=MyCombinedProperty PropertyFieldCSVs=*/MyDateField,*/MyIndexField [IndexAndWeightHigher] Property=IndexHigherWeight PropertyFieldCSVs=*/SUMMARIES
Create a section for each of the properties and specify appropriate configuration parameters for each. These configuration parameters define the processes that are applied to all the fields (or all documents that contain the fields) that you previously associated with the processes.
For example:
[MyFirstProperty] HiddenType=True [MySecondProperty] Index=True [MyCombinedProperty] DateType=True Index=True [IndexHigherWeight] Index=True Weight=2
Example:
[FieldProcessing] 0=IndexFields 1=IndexAndWeightHigher 2=SectionBreakFields 3=DateFields 4=DatabaseFields 5=SetReferenceFields [IndexFields] // Controls which fields are indexed Property=Index PropertyFieldCSVs=*/DRECONTENT,*/DRETITLE [IndexAndWeightHigher] // Fields to index with a weight Property=IndexWeight PropertyFieldCSVs=*/SUMMARIES [SectionBreakFields] // Field containing document section number Property=Section PropertyFieldCSVs=*/DRESECTION [DateFields] // Fields containing the document date Property=Date PropertyFieldCSVs=*/DREDATE,*/harvest_time [DatabaseFields] // CSV of field names that define the document database Property=Database PropertyFieldCSVs=*/DREDBNAME [SetReferenceFields] // CSV of fields that define the document URL Property=Reference PropertyFieldCSVs=*/DREREFERENCE,*/DRETITLE //---------------------------Properties----------------------// [Index] Index=True [IndexWeight] Index=True Weight=2 [Section] SectionBreakType=True [Date] DateType=True [Database] DatabaseType=True [Reference] ReferenceType=True TrimSpaces=True
|