LangIdSegWav

The LangIdSegWav task reads in data from an audio file, converts it into language identification features, and then processes the data in fixed-sized chunks. It returns the language identification results for each chunk.

Parameters

Parameter Description Required
Type The task type. Set to LangIdSegWav. Yes
Beam The beam width of the search process.  
ClassList A list of language classifiers to use. Yes
ClassPath The path to the directory containing the language classifiers. Yes
DnnFile The Deep Neural Network acoustic modeling file to use.  
File The audio file to process. Yes
Lang The name of the language pack to use. Yes
LangList A subset of languages to use from the classifier list file.  
NBest The maximum number of language candidates to include in the output file.  
Out The file to write language identification results to. Yes
SegSize The maximum results segment size.  
SugdInputChannels The channel layout of the input media file.  
SugdInputFrequency The sampling rate of the input media file.  

Example

http://localhost:13000/action=AddTask&Type=LangIdSegWav&File=C:\Data\Speech.wav&ClassList=ListManager/OptClassSet&ClassPath=C:\LangID\&Out=SpeechLang7.ctm

This action uses port 13000 to instruct IDOL Speech Server, which is located on the local machine, to identify the language in the Speech.wav file using the language classifiers specified in the OptClassSet list. The action instructs IDOL Speech Server to write the identification results to the SpeechLang7.ctm file.


_HP_HTML5_bannerTitle.htm