SpkIdEvalWav

The SpkIdEvalWav task performs speaker identification on a single audio file.

NOTE:

To process an audio stream, use the SpkIdEvalStream task.

Parameters

Parameter Description Required
Type The task name. Set to SpkIdEvalWav. Yes
AllowEmpty Whether to produce gender labels as output if no speakers are specified.  
ClosedSet Whether the task that you are running is a closed-set test.  
DiagFile The name of the file to write diagnostic information to.  
DiagLevel The level of detail to include in the diagnostic information.  
DiscardShort Exclude segments shorter than a specific duration from further analysis.  
File The audio file to process.  
MinNonSpeech The minimum size in seconds of non-speech segments.  
MinSpeech The minimum size in seconds of speech segments.  
Out The file to write the results to.  
Sfreq The sample frequency of the audio stream to process.  
SugdInputChannels The channel layout of the input media file.  
SugdInputFrequency The sampling rate of the input media file.  

TemplateExt

The file extension to use for template files.  
TemplateList A list file that lists multiple speaker template files to use. Yes, if TemplateSet is not specified
TemplatePath The path to the directory containing the speaker templates.  
TemplateSet An audio template set file. Yes, if TemplateList is not specified
ThreshScale The rate at which to scale the thresholds.  

Example

http://localhost:15000/action=AddTask&Type=SpkIdEvalWav&File=C:\Data\Speech.wav&TemplateSet=speakers.ats&ClosedSet=False&Out=results.ctm

This action uses port 15000 to instruct HPE IDOL Speech Server, which is located on the local machine, to search the Speech.wav file for speakers based on the template set file speakers.ats, and to write the identification results to the results.ctm file.

Because the test is set to be open-set, HPE IDOL Speech Server marks sections where no speaker scores above their respective thresholds as Unknown_.


_HP_HTML5_bannerTitle.htm