This task takes a single audio file containing speech data from the speaker to be trained, and creates a new speaker template file.
To create a speaker template from a single audio file
Send an AddTask
action to HPE IDOL Speech Server, and set the following parameters:
Type
|
The task name. Set to SpkIdTrainWav . |
File
|
The audio file that contains sample speech from one person. |
Out
|
The name of the speaker template file to create. You must include the audio template file extension (.atf ). |
For example:
http://localhost:15000/action=AddTask&Type=SpkIdTrainWav&file=C:/Data/BrownSpeech.wav&out=Brown.atf
This action uses port 15000
to instruct HPE IDOL Speech Server, which is located on the local machine, to create the Brown.atf
template file by using the BrownSpeech.wav
file.
This action returns a token. You can use the token to:
To process streamed audio, use the SpkIdTrainStream
task. For more details about this standard task, see the HPE IDOL Speech Server Reference.