The LangIdCumStream
task is deprecated in IDOL Speech Server version 11.5 and later. Use the LangId task instead.
This task is still available for existing implementations, but it might be incompatible with new functionality. The task might be deleted in future.
The LangIdCumStream
task receives audio data as a binary stream and converts it into language identification features. The task returns the running language identification score at periodic intervals. This is the score for all the data from the start to the current point.
Parameter | Description | Required |
---|---|---|
Type | The task name. Set to LangIdCumStream . |
Yes |
AppDnnBase | The location of the appResources directory, which contains the DNN and .ian files to use. |
|
AppFrameDupl | The balance between performance and speed for audio preprocessing DNN classification. | |
Beam | The beam width of the search process. | |
ClassList | A list of language classifiers to use. | |
ClassPath | The path to the directory containing the language classifiers. | |
ClosedSet | Whether to use closed set or open set language identification. | |
DiagFile | The file to write the diagnostic information to. | |
DiagLevel | The level of detail to include in the diagnostic information. | |
DnnFile | The Deep Neural Network acoustic modeling file to use. | |
FrameDupl | An integer value which allows for greater time efficiency with only a minimal loss of recognition accuracy. | |
Lang | The name of the language pack to use. | Yes |
LangList | A subset of languages to use from the classifier list file. | |
MinRate | The minimum phone rate (phones per second). | |
NBest | The maximum number of language candidates to include in the output file. | |
Out | The file to write language identification results to. | Yes |
SegSize | The maximum results segment size. | |
SilThresh | The threshold between what the task identifies as silence and non-silence. | |
SpeechThresh | The threshold between speech and non-speech (music or noise). |
The ClassList
parameter is required only if you want to change the audio sample rate, or if you want to use your own custom classifiers. You might also need to specify the ClassPath
parameter, depending on the location of the classifier files.
http://localhost:15000/action=AddTask&Type=LangIdCumStream&ClassList=ListManager/OptClassSet&ClassPath=C:\LangID\&Out=SpeechLang3.ctm
This action identifies the language in the audio stream using the language classifiers specified in the OptClassSet
list. IDOL Speech Server writes the identification results to the SpeechLang3.ctm
file.
|