The AmTrain
task presents training audio and transcription data to the acoustic model training process to create accumulator files. The AmTrainFinal task uses these accumulator files to produce a final adapted acoustic model.
Parameter | Description | Required |
---|---|---|
Type | The task name. Set to AmTrain . |
Yes |
AdaptSil | Whether to adapt the silence model. | |
Am | The acoustic model to adapt. | Yes |
BeamStep | The amount to increase the beam value by on a pass failure, before attempting another pass. | |
DataList | A list of the adaptation files. | Yes |
Diag | Whether to generate diagnostic information. | |
DiagFile | The file to write the diagnostic information to. | |
Junk | Whether to identify words in the adaptation data with poor alignment scores as junk. | |
JunkThresh | The alignment score threshold. Word alignments scoring above this value are labeled as junk. | |
MaxBeam | The maximum beam value at which to attempt the adaptation pass. | |
MinBeam | The minimum beam value at which to attempt the adaptation pass. | |
MLLRMaxMins | Whether to use standard acoustic adaptation or rapid adaptation mode. | |
MLLRMinOcc | When the AmTrain task runs in rapid adaptation mode, the minimum number of times that a basic phoneme (for example, ‘d’) must occur in the adaptation data before an individual phoneme transform can be used for adaptation. |
|
Out | The name of the adaptation accumulator (.acc) file to produce. | Yes |
OutLabExt | The label file extension. | |
OutLabPath | The directory to write label files to. By default, IDOL Speech Server writes the files to the configured temp directory. | |
Pgf | The pronunciation generation (.pgf) file included in the language pack. | Yes |
PlhExt | The file extension of the input audio feature files. | |
PlhPath | The path to the directory containing the acoustic feature (.plh) files specified in the DataList. | Yes |
RelaxRestrain | Relaxes time restraints by a specified number of frames. | |
Restrain | Whether to apply time constraints to the locations of the words in audio during processing. | |
SilRestrain | Whether to apply time constraints to the locations of silence in audio during processing. | |
TxtExt | The file extension of the input transcription files. | |
TxtPath | The path to the directory containing the transcript (.ctm) files specified in the DataList. | Yes |
WriteOutLabs | Whether to create label files. | |
ZeroDurWords | Whether to label zero-duration words as junk. |
http://localhost:13000/action=AddTask&Type=AmTrain&Am=C:\LP\ENUK\ver-ENUK-5.0-16k.am&Pgf=C:\LP\ENUK\ver-ENUK-5.0.pgf&DataList=ListManager/OptList&PlhPath=C:\data\PLH&TxtPath=C:\data\transcripts&Out=AmAcc.acc
This action uses port 13000
to instruct IDOL Speech Server, which is located on the local machine, to produce the AmAcc
accumulator file using the ver-ENUK-5.0-16k
acoustic model, ver-ENUK-5.0
pronunciation generation file, audio feature files stored in C:\data\PLH
, and transcription files stored in C:\data\transcripts
.
|