IvSpkIdDevelAudio

The IvSpkIdDevelAudio task processes an audio file or stream to generate scores for tuning iVector thresholds. The task generates a single iVector development score file (.ivd). You can append the scores to an existing .ivd file.

You can score the audio against one or more iVector template files (.iv).

For more information on development score files, see the IDOL Speech Server Administration Guide.

Parameters

Parameter Description Required
Type The task name. Set to IvSpkIdDevelAudio. Yes
Append Whether to append speaker scores for each audio file to a common .ivd file.  
DataLabel The name of the speaker that the audio is associated with. Yes
DevFile The name of the audio template development file. Yes
DiagFile The name of the file to write diagnostic information to.  
DiagLevel The level of detail to include in the diagnostic information.  
EndTime The end of an audio section to process.  
File The audio file that contains the speaker example speech. Yes, if InputType is File.
FrameDupl The balance between performance and speed for audio preprocessing DNN classification.  
InputType The type of audio to process (file, binary data, or stream).  
LabFile A single label file to use.  
LabType The type of labels to use.  
MinLen The minimum length of an audio segment.  
Sfreq The sample frequency of the audio file to process.  
StartTime The beginning of an audio section to process.  
SugdInputChannels The channel layout of the input media file. This parameter does not apply when InputType is Stream.  
SugdInputFrequency The sampling rate of the input media file. This parameter does not apply when InputType is Stream.  
TemplateExt The file extension to use for template files.  
TemplateFile A single speaker template file to use to generate scores. See Comments
TemplateList A list file that lists the speaker template files to use to generate scores. See Comments
TemplateName The name associated with the template file. See Comments
TemplatePath The path to the directory that contains the template files. See Comments
Win The size of each audio segment.  

Comments

You must set one of the following parameter combinations:

If you set TemplateList, and your templates are not in the default location (the SpeakerIDDir directory), you must also set TemplatePath.

Examples

http://localhost:15000/action=AddTask&Type=IvSpkIdDevelAudio&InputType=File&File=C:/Data/BrownSpeech4.wav&Label=Brown&TemplateList=ListManager/speakers&DevFile=speakers.ivd

This action uses port 15000 to instruct IDOL Speech Server, which is located on the local machine, to generate match statistics for the speaker named Brown by checking the example speech in BrownSpeech4.wav against the speaker templates specified in the speakers list, and writing the results to the speakers.ivd development file.

http://localhost:15000/action=AddTask&Type=IvSpkIdDevelAudio&InputType=Stream&DataLabel=Brown&TemplateList=ListManager/speakers&DevFile=speakers.ivd

This action uses port 15000 to instruct IDOL Speech Server, which is located on the local machine, to create the Brown.ivd template file for the audio stream.


_HP_HTML5_bannerTitle.htm