To run speech-to-text
Create a new configuration to send to HPE Media Server with the process
action, or open an existing configuration that you want to modify.
In the [Analysis]
section, add a new analysis task by setting the AnalysisEngineN
parameter. You can give the task any name, for example:
[Analysis] AnalysisEngine0=TranscribeSpeech
Create a new section to contain the settings for the task and set the following parameters:
Type
|
The analysis engine to use. Set this parameter to SpeechToText . |
Input
|
(Optional) The audio track to analyze. If you do not specify an input track, HPE Media Server processes the first track of the correct type produced by the ingest engine. |
SpeechToTextServers
|
The host name and ACI port of an IDOL Speech Server. Separate the host name and port with a colon (for example, TIP:
You can specify a default IDOL Speech Server to use for all speech-to-text tasks by setting the |
Language
|
The language pack to use. For a list of available language packs, refer to the IDOL Speech Server Administration Guide. |
Mode
|
The mode for speech-to-text transcription. HPE Media Server supports the following modes:
These modes are described in the IDOL Speech Server Reference. |
ModeValue
|
The mode value for speech-to-text transcription. For more information about this parameter, refer to the IDOL Speech Server Reference. |
FilterMusic
|
(Optional) Specifies whether to ignore speech-to-text results for audio segments that Speech Server identifies as music or noise. To filter these results from the output, set this parameter to true . |
SampleFrequency
|
(Optional) The sample frequency of the audio to send to the IDOL Speech Server for analysis, in samples per second (Hz). IDOL Speech Server language packs are dependent on the audio sample rate, and accept audio at either 8000Hz or 16000Hz. |
For example:
[TranscribeSpeech] Type=SpeechToText SpeechToTextServers=speechserver:13000 Language=ENUK Mode=relative ModeValue=0.8 FilterMusic=TRUE
For more information about the parameters that you can use to configure speech-to-text, refer to the HPE Media Server Reference.
Save and close the configuration file. HPE recommends that you save your configuration files in the location specified by the ConfigDirectory
parameter.
|