IvSpkIdFeature

The IvSpkIdFeature task uses an audio file that contains sample speech from one person to create speaker ID feature files for use in iVector based template training, and template score threshold development.

Parameters

Parameter Description Required
Type The task name. Set to IvSpkIdFeature. Yes
AudioUpsampling Whether to allow audio upsampling if the input audio has a sample rate too low for the task.  
EndTime The end of an audio section to process.  
FrameDupl The balance between performance and speed for audio preprocessing DNN classification.  
File The audio file that contains sample speech from one person. Yes, if InputType is File.
InputType The type of audio to process (file, binary data, or stream).  
Out The name of the speaker ID feature file to create. Yes
Sfreq The sample frequency of the audio file to process.  
StartTime The beginning of an audio section to process.  
SugdInputChannels The channel layout of the input media file. This parameter does not apply when InputType is Stream.  
SugdInputFrequency The sampling rate of the input media file. This parameter does not apply when InputType is Stream.  

Example

http://localhost:15000/action=AddTask&Type=IvSpkIdFeature&File=C:/Data/BrownSpeech1.wav&out=BrownSpeech1.ivp

This action creates the BrownSpeech1.ivp feature file using the BrownSpeech1.wav file.


_FT_HTML5_bannerTitle.htm