The SpkIdTrainWav
task is deprecated in IDOL Speech Server version 11.4 and later. Use the IvSpkIdTrainAudio task instead.
This task is still available for existing implementations, but it might be incompatible with new functionality. The task might be deleted in future.
The SpkIdTrainWav
task takes a single audio file containing speech data from the speaker to be trained, and creates a new speaker template file.
To process streamed audio, use the SpkIdTrainStream task.
Parameter | Description | Required |
---|---|---|
Type | The task name. Set to SpkIdTrainWav .
|
Yes |
DiagFile | The file to write the diagnostic information to. | |
DiagLevel | The level of detail to include in the diagnostic information. | |
File | The audio file that contains sample speech from one person. | Yes |
LabExt | The file extension to use for label files. | Yes, if your files have an extension other than the default (.lab )
|
LabFile | A single label file to use. | |
LabPath | The path to the label files. | Yes, if you have enabled labeling and are specifying a list of multiple files to use |
LabType | The type of labels to use. | |
Out | The name of the speaker template file to create. You must include the audio template file extension (
.atf ). |
Yes |
Sfreq | The sample frequency of the audio file to process. | |
SugdInputChannels | The channel layout of the input media file. | |
SugdInputFrequency | The sampling rate of the input media file. | |
UBMFile | The Universal Background Model to use. |
http://localhost:15000/action=AddTask&Type=SpkIdTrainWav&File=C:/Data/BrownSpeech.wav&Out=Brown.atf
This action uses port 15000
to instruct IDOL Speech Server, which is located on the local machine, to create the Brown.atf
template file by using the BrownSpeech.wav
file.
|