The AfptAddTrackWav
task allows you to add a new audio track to an audio fingerprint database, reading the data from an audio file and converting it into AFP features before indexing. If the database does not already exist, it is created by adding the track.
The AfptAddTrackWav
task uses templates as opposed to landmarks, which improves robustness to audio mismatches at the cost of scaleability.
Parameter | Description | Required |
---|---|---|
Type | The task name. Set to AfptAddTrackWav . |
Yes |
AfpDb | The audio fingerprint database (if it is defined in the configuration file). | * |
File | The name of the audio file to index. | Yes |
Pack | The audio fingerprint database (if it is not defined in the configuration file). | * |
PackDir |
The path to the directory containing the database. If this directory does not already exist, manually create it before sending the action. If the database does not yet exist, HPE IDOL Speech Server creates the database in this directory. |
* |
Rnd | A window size within which a random offset shift is calculated. | |
Sfreq | The sample frequency of the audio file to process. | |
Tag | The tag associated with the track being added. | Yes |
The database type for template-based audio fingerprinting is fptdb
(as opposed to fpdb
for landmark-based audio fingerprinting).
http://localhost:13000/action=AddTask&Type=AfptAddTrackWav&File=C:\Data\Jingle.wav&Tag=MyCompanyJingle&PackDir=C:\resources&Pack=Adverts
This action uses port 13000
to instruct HPE IDOL Speech Server, which is located on the local machine, to use template-based matching to index the audio file Jingle.wav
into the Adverts
database. If the Adverts
database does not already exist, HPE IDOL Speech Server creates it. If the clip is subsequently identified in an audio file, HPE IDOL Speech Server identifies it using the tag MyCompanyJingle
.