The SpkIdTrainWav
task takes a single audio file containing speech data from the speaker to be trained, and creates a new speaker template file.
Note: To process streamed audio, use the SpkIdTrainStream task.
Parameter | Description | Required |
---|---|---|
Type | The task name. Set to SpkIdTrainWav .
|
Yes |
DiagFile | The file to write the diagnostic information to. | |
DiagLevel | The level of detail to include in the diagnostic information. | |
File | The audio file that contains sample speech from one person. | Yes |
LabExt | The file extension to use for label files. | Yes, if your files have an extension other than the default (.lab )
|
LabFile | A single label file to use. | |
LabPath | The path to the label files. | Yes, if you have enabled labeling and are specifying a list of multiple files to use |
LabType | The type of labels to use. | |
Out | The name of the speaker template file to create. You must include the
audio template
file extension
(
.atf
)
. |
Yes |
Sfreq | The sample frequency of the audio file to process. | |
SugdInputChannels | The channel layout of the input media file. | |
SugdInputFrequency | The sampling rate of the input media file. | |
UBMFile | The Universal Background Model to use. |
http://localhost:15000/action=AddTask&Type=SpkIdTrainWav&File=C:/Data/BrownSpeech.wav&Out=Brown.atf
This action uses port 15000
to instruct HPE IDOL Speech Server, which is located on the local machine, to create the Brown.atf
template file by using the BrownSpeech.wav
file.
|