The AmTrain
task is deprecated in IDOL Speech Server version 11.5 and later. Micro Focus recommends that you use the newer Deep Neural Network (DNN) techniques, rather than GMM-based acoustic models.
This task is still available for existing implementations, but it might be incompatible with new functionality. The task might be deleted in future.
The AmTrain
task presents training audio and transcription data to the acoustic model training process to create accumulator files. The AmTrainFinal task uses these accumulator files to produce a final adapted acoustic model.
Parameter | Description | Required |
---|---|---|
Type | The task name. Set to AmTrain . |
Yes |
AdaptSil | Whether to adapt the silence model. | |
Am | The acoustic model to adapt. | Yes |
BeamStep | The amount to increase the beam value by on a pass failure, before attempting another pass. | |
DataList | A list of the adaptation files. | Yes |
Diag | Whether to generate diagnostic information. | |
DiagFile | The file to write the diagnostic information to. | |
Junk | Whether to identify words in the adaptation data with poor alignment scores as junk. | |
JunkThresh | The alignment score threshold. Word alignments scoring above this value are labeled as junk. | |
MaxBeam | The maximum beam value at which to attempt the adaptation pass. | |
MinBeam | The minimum beam value at which to attempt the adaptation pass. | |
MLLRMaxMins | Whether to use standard acoustic adaptation or rapid adaptation mode. | |
MLLRMinOcc | When the AmTrain task runs in rapid adaptation mode, the minimum number of times that a basic phoneme (for example, ‘d’) must occur in the adaptation data before an individual phoneme transform can be used for adaptation. |
|
Out | The name of the adaptation accumulator (.acc) file to produce. | Yes |
OutLabExt | The label file extension. | |
OutLabPath | The directory to write label files to. By default, IDOL Speech Server writes the files to the configured temp directory. | |
Pgf | The pronunciation generation (.pgf) file included in the language pack. | Yes |
PlhExt | The file extension of the input audio feature files. | |
PlhPath | The path to the directory containing the acoustic feature (.plh) files specified in the DataList. | Yes |
RelaxRestrain | Relaxes time restraints by a specified number of frames. | |
Restrain | Whether to apply time constraints to the locations of the words in audio during processing. | |
SilRestrain | Whether to apply time constraints to the locations of silence in audio during processing. | |
TxtExt | The file extension of the input transcription files. | |
TxtPath | The path to the directory containing the transcript (.ctm) files specified in the DataList. | Yes |
WriteOutLabs | Whether to create label files. | |
ZeroDurWords | Whether to label zero-duration words as junk. |
http://localhost:15000/action=AddTask&Type=AmTrain&Am=C:\LP\ENUK\ver-ENUK-5.0-16k.am&Pgf=C:\LP\ENUK\ver-ENUK-5.0.pgf&DataList=ListManager/OptList&PlhPath=C:\data\PLH&TxtPath=C:\data\transcripts&Out=AmAcc.acc
This action produces the AmAcc
accumulator file using the ver-ENUK-5.0-16k
acoustic model, ver-ENUK-5.0
pronunciation generation file, audio feature files stored in C:\data\PLH
, and transcription files stored in C:\data\transcripts
.
|