You must combine the individual language classifiers into a single language identification set. To identify the language being spoken, HPE IDOL Speech Server compares audio against this set.
To combine the language classifiers into a set
Create a list of classifiers. Use the following format for each list entry:
LanguageName;ClassifierName;LanguageWeight
where:
LanguageName
|
is the name that is reported in the results. |
ClassifierName
|
is the name of the trained classifier file for this language. NOTE: It is not necessary to specify the full path to the classifier in this list, because you can use an action parameter to provide this information when you use the classifier set.
|
LanguageWeight
|
is the weight to apply to the language. HPE IDOL Speech Server uses weights to scale the scores for each language before it compares them. You can leave this field empty before you run the LangIdOptimize task, which generates optimized weights (see Optimize the Language Identification Set). |
Separate the three fields with semi-colons (;).
For more information about HPE IDOL Speech Server's list manager, see Create and Manage Lists.
The following example list contains four language classifiers: English, French, German, and Italian.
English;ENUK.lcf;1.0 French;FRFR.lcf;1.0 German;DEDE.lcf;1.0 Italian;ITIT.lcf;1.0
You can also use the optional $sfreq
parameter at the start of the classifier list to identify the sample frequency for the classifier set.
For example:
$sfreq;16000 ENUK;ENUK.lcf;1.003 ESES;ESES.lcf;0.985
This example sets the sample frequency for this set to 16 kHz, and then lists the two classifiers in the set. In addition, usage is restricted to audio of the specified sample frequency, to prevent accidental mismatches.
|