The IvSpkIdEvalWav
task is deprecated in IDOL Speech Server version 11.5 and later. Use the IvSpkId task instead.
This task is still available for existing implementations, but it might be incompatible with new functionality. The task might be deleted in future.
The IvSpkIdEvalWav
task performs iVector-based speaker identification on a single audio file.
To process an audio stream, use the IvSpkIdEvalStream task.
Parameter | Description | Required |
---|---|---|
Type | The task name. Set to IvSpkIdEvalWav . |
Yes |
AllowEmpty | Whether to produce gender labels as output if no speakers are specified. | |
DiagFile | The name of the file to write diagnostic information to. | |
DiagLevel | The level of detail to include in the diagnostic information. | |
DiscardShort | Exclude segments shorter than a specific duration from further analysis. | |
File | The audio file to process. | |
FrameDupl | The balance between performance and speed for audio preprocessing DNN classification. | |
MaxNonSpeech | The maximum length of non-speech segments. | |
MaxSpeech | The maximum length of speech segments. | |
MinNonSpeech | The minimum length of non-speech segments. | |
MinSpeech | The minimum length of speech segments. | |
Out | The file to write the results to. | |
Sfreq | The sample frequency of the audio stream to process. | |
SugdInputChannels | The channel layout of the input media file. | |
SugdInputFrequency | The sampling rate of the input media file. | |
The file extension to use for template files. | ||
TemplateList | A list file that lists multiple speaker template files to use. | |
TemplatePath | The path to the directory containing the speaker templates. | |
TemplateSet | An audio template set file. | |
ThreshScale | The rate at which to scale the thresholds. |
http://localhost:15000/action=AddTask&Type=IvSpkIdEvalWav&File=C:\Data\Speech.wav&TemplateSet=speakers.ivs&ClosedSet=False&Out=results.ctm
This action uses port 15000
to instruct IDOL Speech Server, which is located on the local machine, to search the Speech.wav
file for speakers based on the template set file speakers.ivs
, and to write the identification results to the results.ctm
file.
|