LangIdCumWav

DEPRECATED:

The LangIdCumWav task is deprecated in IDOL Speech Server version 11.5 and later. Use the LangId task instead.

This task is still available for existing implementations, but it might be incompatible with new functionality. The task might be deleted in future.

The LangIdCumWav task reads in data from an audio file and converts it into language identification features. The task returns the running language identification score at periodic intervals. This is the score for all the data from the start to the current point.

Parameters

Parameter Description Required
Type The task type. Set to LangIdCumWav. Yes
AppDnnBase The location of the appResources directory, which contains the DNN and .ian files to use.  
AppFrameDupl The balance between performance and speed for audio preprocessing DNN classification.  
Beam The beam width of the search process.  
ClassList A list of language classifiers to use.  
ClassPath The path to the directory containing the language classifiers.  
ClosedSet Whether to use closed set or open set language identification.  
DiagFile The file to write the diagnostic information to.  
DiagLevel The level of detail to include in the diagnostic information.  
DnnFile The Deep Neural Network acoustic modeling file to use.  
EndTime The end of an audio section to process.  
File The audio file to process. Yes
FrameDupl An integer value which allows for greater time efficiency with only a minimal loss of recognition accuracy.  
Lang The name of the language pack to use. Yes
LangList A subset of languages to use from the classifier list file.  
MinRate The minimum phone rate (phones per second).  
NBest The maximum number of language candidates to include in the output file.  
Out The file to write language identification results to. Yes
SegSize The maximum results segment size.  
SilThresh The threshold between what the task identifies as silence and non-silence.  
SpeechThresh The threshold between speech and non-speech (music or noise).  
StartTime The beginning of an audio section to process.  
SugdInputChannels The channel layout of the input media file.  
SugdInputFrequency The sampling rate of the input media file.  
NOTE:

The ClassList parameter is required only if you want to change the audio sample rate, or if you want to use your own custom classifiers. You might also need to specify the ClassPath parameter, depending on the location of the classifier files.

Example

http://localhost:15000/action=AddTask&Type=LangIdCumWav&File=C:\Data\Speech.wav&ClassList=ListManager/OptClassSet&ClassPath=C:\LangID\&Out=SpeechLang4.ctm

This action identifies the language in the Speech.wav file using the language classifiers specified in the OptClassSet list. IDOL Speech Server writes the identification results to the SpeechLang4.ctm file.


_FT_HTML5_bannerTitle.htm