SpeechThresh

The threshold between speech and non-speech (music or noise).

NOTE:

From the 11.2 release, HPE IDOL Speech Server uses a new algorithm for audio preprocessing. The new algorithm uses DNN technology, which provides better performance and requires less tailoring to specific audio types.

The new implementation ignores speech and silence threshold parameters, and also distinguishes between music and noise, rather than simply recognizing music and noise as a single non-speech category. All tasks use the new algorithm, but the old algorithm is retained for backwards compatibility, and can be used in exactly the same way as before.

Action: AddTask, CheckResources
Task: SpeechSilClassification, LangIdSegWav, LangIdCumWav, LangIdBndWav, LangIdSegStream, LangIdCumStream, LangIdBndStream, StreamToTextMusicFilter, StreamToTextMusicFilterPunct
Type: Integer
Default: -25
Range: -50–10
Example: SpeechThresh=0
See Also: SpeechThreshOffset (configuration parameter)

_HP_HTML5_bannerTitle.htm