The IvSpkIdDevelStream
task takes a single audio stream, along with the name of the speaker the stream is associated with, and generates scores for tuning iVector thresholds. The task generates a single iVector development score file (.ivd
). You can append the scores to an existing .ivd
file.
You can score the audio against one or more iVector template files (.iv
).
To process audio from a file, use the IvSpkIdDevelWav task.
For more information on development score files, see the HPE IDOL Speech Server Administration Guide.
Parameter | Description | Required |
---|---|---|
Type | The task name. Set to IvSpkIdDevelStream . |
Yes |
Append | Whether to append speaker scores for each audio file to a common .ivd file.
|
|
DataLabel | The name of the speaker that the audio is associated with. | Yes |
DevFile | The name of the iVector development file. | Yes |
DiagFile | The name of the file to write diagnostic information to. | |
DiagLevel | The level of detail to include in the diagnostic information. | |
FrameDupl | The balance between performance and speed for audio preprocessing DNN classification. | |
LabFile | A single label file to use. | |
LabType | The type of labels to use. | |
MinLen | The minimum length of an audio segment. | |
Sfreq | The sample frequency of the audio file to process. | |
TemplateExt | The file extension to use for template files. | |
TemplateFile | A single speaker template file to use to generate scores. | Yes, if TemplateList is not set |
TemplateList | A list file that lists the speaker template files to use to generate scores. | Yes, if TemplateFile is not set |
TemplateName | The name associated with the template file. | Yes, if TemplateFile is set |
TemplatePath | The path to the directory that contains the template files. | Yes, if TemplateList is set and the templates are not in the default location (the SpeakerIDDir directory) |
Win | The size of each audio segment. |
http://localhost:15000/action=AddTask&Type=IvSpkIdDevelStream&DataLabel=Brown&TemplateList=ListManager/speakers&DevFile=speakers.ivd
This action uses port 15000
to instruct HPE IDOL Speech Server, which is located on the local machine, to create the Brown.ivd
template file for the audio stream.
|