The StreamSidTrain
task is deprecated for HPE IDOL Server version 11.3. Use the SpkIdTrainStream
task instead.
This task is still available for existing implementations, but it might be incompatible with new functionality. The task might be deleted in future.
The StreamSidTrain
task receives sample audio data for a specific speaker from a binary stream, and creates a speaker model to represent this speaker.
Aim to use a minimum of five minutes of speech for each speaker.
Parameter | Description | Required |
---|---|---|
Type | The task name. Set to StreamSidTrain . |
Yes |
Ast | The speaker classifier file. | See Comments. |
Diag | Whether to generate diagnostic information. | |
DiagFile | The file to write the diagnostic information to. | |
MinFrames | The minimum number of speech audio frames required to train each component of a speaker model. | |
NewModel | The speaker template file to create. | Yes |
NMix | The number of components to create in the speaker model. | |
Rel | The relevance to give to USM model parameters during adaptation. | |
Sfreq | The sample frequency of the audio file to process. | |
SidBase | The sid base pack resource to use to determine the base files to use. | |
Sig | The .sig file to use for speaker identification. | |
USM | The USM file to use. | |
USMEnabled | Whether to use the USM as a base for speaker training. |
http://localhost:13000/action=AddTask&Type=StreamSidTrain&NewModel=Bob.spk
This action uses port 13000
to instruct HPE IDOL Speech Server, which is located on the local machine, to create the Bob.spk
template using the sample data from the audio stream.
If you do not specify the Ast parameter, the action uses the base ast file, determined by the SidBase resource. This base file does not contain any speaker information, and cannot identify speakers, but it performs gender detection and speaker segmentation.
|