IvSpkIdEvalStream

DEPRECATED:

The IvSpkIdEvalStream task is deprecated in IDOL Speech Server version 11.5 and later. Use the IvSpkId task instead.

This task is still available for existing implementations, but it might be incompatible with new functionality. The task might be deleted in future.

After you have trained a set of speaker templates, use the IvSpkIdEvalStream task to run iVector-based identification of any sections of an audio stream where the trained speakers are present.

NOTE:

To process an audio file, use the IvSpkIdEvalWavtask.

Parameters

Parameter Description Required
Type The task name. Set to IvSpkIdEvalStream. Yes
AllowEmpty Whether to produce gender labels as output if no speakers are specified.  
DiagFile The file to write the diagnostic information to.  
DiagLevel The level of detail to include in the diagnostic information.  
DiscardShort Exclude segments shorter than a specific duration from further analysis.  
FrameDupl The balance between performance and speed for audio preprocessing DNN classification.  
MaxNonSpeech The maximum length of non-speech segments.  
MaxSpeech The maximum length of speech segments.  
MinNonSpeech The minimum length of non-speech segments.  
MinSpeech The minimum length of speech segments.  
Out The file to write the results to.  
Sfreq The sample frequency of the audio stream to process.  

TemplateExt

The file extension to use for template files.  
TemplateList A list file that lists multiple speaker template files to use.  
TemplatePath The path to the directory containing the speaker templates.  
TemplateSet An audio template set file.  
ThreshScale The rate at which to scale the thresholds.  

Example

http://localhost:15000/action=AddTask&Type=IvSpkIdEvalStream&File=C:\Data\Speech.wav&TemplateSet=speakers.ivs&Out=results.ctm

This action searches the audio stream for speakers based on the iVector-based template set file speakers.ivs, and writes the identification results to the results.ctm file.


_FT_HTML5_bannerTitle.htm