The IvSpkIdFeature
task uses an audio file that contains sample speech from one person to create speaker ID feature files for use in iVector based template training, and template score threshold development.
Parameter | Description | Required |
---|---|---|
Type | The task name. Set to IvSpkIdFeature .
|
Yes |
AudioUpsampling | Whether to allow audio upsampling if the input audio has a sample rate too low for the task. | |
EndTime | The end of an audio section to process. | |
FrameDupl | The balance between performance and speed for audio preprocessing DNN classification. | |
File | The audio file that contains sample speech from one person. | Yes, if InputType is File . |
InputType | The type of audio to process (file, binary data, or stream). | |
Out | The name of the speaker ID feature file to create. | Yes |
Sfreq | The sample frequency of the audio file to process. | |
StartTime | The beginning of an audio section to process. | |
SugdInputChannels | The channel layout of the input media file. This parameter does not apply when InputType is Stream . |
|
SugdInputFrequency | The sampling rate of the input media file. This parameter does not apply when InputType is Stream . |
http://localhost:15000/action=AddTask&Type=IvSpkIdFeature&File=C:/Data/BrownSpeech1.wav&out=BrownSpeech1.ivp
This action creates the BrownSpeech1.ivp
feature file using the BrownSpeech1.wav
file.
|