You can customize how documents are processed by import tasks, by adding the following fields to your documents.
The Lua script that adds the document fields must run before the import tasks.
To use an IDOL Speech Server to extract the speech from a document that represents an audio or video file, you must add the field AUTN_NEEDS_TRANSCRIPTION
to the document. The IdolSpeech
task only runs on documents that have this field. The field can have any value. For more information about the IdolSpeech
task, see Analyze Speech.
To bypass the transcoding step of an IdolSpeech
task, add the field AUTN_FORMAT_CORRECT_FOR_TRANSCRIPTION
. The field can have any value. Documents that have this field are not sent to a Transcode Server. For more information about the IdolSpeech
task, see Analyze Speech.
To bypass the language identification step of an IdolSpeech
task add the field AUTN_AUDIO_LANGUAGE
. The value of the field must be the name of the IDOL Speech Server language pack to use for extracting speech. Documents that have this field are not sent to the IDOL Speech Server for language identification. For more information about the IdolSpeech
task, see Analyze Speech.
To perform analysis on media files using the MediaServerAnalysis
task, you must add this field to every document that you want to analyze. The field can have any value.
|