To identify the language of speech
Create a new configuration or open an existing configuration to send to Media Server with the process
action. Alternatively, you can modify the Media Server configuration file (mediaserver.cfg
).
In the [Analysis]
section, add a new analysis task by setting the AnalysisEngineN
parameter. You can give the task any name, for example:
[Analysis] AnalysisEngine0=SpeechLanguageId
Create a new section to contain the settings for the task, and set the following parameters:
Type
|
The analysis engine to use. Set this parameter to LanguageID . |
Input
|
(Optional) The audio track to process. If you do not specify an input track, Media Server processes the first audio track produced by the ingest engine. |
LanguageIdServers
|
The host name and ACI port of an IDOL Speech Server. Separate the host name and port with a colon (for example, Tip: You can specify a default IDOL Speech Server to use for all language identification tasks by setting the |
LangList
|
(Optional) The list of languages to consider when running language identification. If you know which languages are likely to be present in the media, HPE recommends setting this parameter because restricting the possible languages can increase accuracy and improve performance. |
CumulativeMode
|
(Optional, default If you expect the audio to contain only one language or you want to identify the primary language that is spoken in the audio, set this parameter to If you set this parameter to |
SegmentSize
|
(Optional, default 15 ) The amount of audio to analyze as a single segment, in seconds. |
For example:
[SpeechLanguageId] Type=LanguageID LanguageIDServers=speechserver:15000 CumulativeMode=True SegmentSize=30
For more information about the parameters that you can use to configure this task, refer to the Media Server Reference.
Save and close the configuration file. If you modified the Media Server configuration file, you must restart Media Server for your changes to take effect.
|