The IvSpkIdTrainStream
task is deprecated in IDOL Speech Server version 11.5 and later. Use the IvSpkIdTrainAudio task instead.
This task is still available for existing implementations, but it might be incompatible with new functionality. The task might be deleted in future.
The IvSpkIdTrainStream
task takes a single audio stream containing speech data from the speaker to be trained, and creates a new iVector speaker template file.
To process audio from a file, use the IvSpkIdTrainWav task.
Parameter | Description | Required |
---|---|---|
Type | The task name. Set to IvSpkIdTrainStream . |
Yes |
DiagFile | The file to write the diagnostic information to. | |
DiagLevel | The level of detail to include in the diagnostic information. | |
FrameDupl | An integer value which allows for greater time efficiency without significant change in recognition accuracy. | |
LabFile | A single label file to use. | |
LabType | The type of labels to use. | |
Out | The name of the speaker template file to create. You must include the
audio template
file extension
(
.iv )
. |
Yes |
Sfreq | The sample frequency of the audio file to process. |
http://localhost:15000/action=AddTask&Type=IvSpkIdTrainStream&Out=Brown.iv
This action creates the Brown.iv
template file for the audio stream.
|