IDOL Speech Server stores phoneme time track information in an .fmd file. You process an individual audio file to create a phoneme time track file. You can then search this file, or combine time track files for multiple audio files into a single file, which you can then search.
To create a phoneme time track file for an audio file
Send an AddTask
action to IDOL Speech Server, and set the following parameters:
Type
|
The task name. Set to CreateFMD . |
File
|
The audio file to process. To restrict processing to a section of the audio file, set the StartTime and EndTime parameters. For more information, see the IDOL Speech Server Reference). |
Lang
|
The phonetic phrase match language pack to use. For more information about available language packs, see Supported Resources. |
Out
|
The name of the phoneme time track file to produce. |
For example:
http://localhost:15000/action=AddTask&Type=CreateFMD&File=C:/myData/Speech.wav&Lang=ENUS-pm&Out=Speech1.fmd
This action processes the Speech.wav
file to produce the Speech1.fmd
time track file, using the ENUS-pm
language pack.
This action returns a token. You can use the token to:
To combine multiple phoneme time track files
Create a list that contains the names of the individual time track files to combine. Each entry in the list must be on a separate line, and must consist of label and file name pairs separated by a semicolon. The label corresponds to the name to give to the file, and the file names correspond to the full path of the files to be combined. For example:
Label1;filename1 Label2;filename2 ...
For more information about IDOL Speech Server's list manager, see Create and Manage Lists.
Send an AddTask
action to IDOL Speech Server, and set the following parameters:
Type
|
The task name. Set to CombineFMD . |
ListFile
|
The list that specifies the individual time track files to combine. |
ListPath
|
The path to the directory that contains the phoneme time track files. |
FileOut
|
The name of the combined phoneme time track file to produce. |
For example:
http://localhost:15000/action=AddTask&Type=CombineFMD&ListFile=ListManager/fmdList&ListPath=C:\PHRASESEARCH\fmd&FileOut=myData.fmd
This action combines the individual phoneme time track files specified in the fmdList
list to produce the single file myData.fmd
.
This action returns a token. You can use the token to:
|