StreamToText

DEPRECATED:

The StreamToText task is deprecated in IDOL Speech Server version 11.5 and later. Use the SpeechToText task instead.

This task is still available for existing implementations, but it might be incompatible with new functionality. The task might be deleted in future.

The StreamToText task converts live audio into a text transcript.

Parameters

Parameter Description Required
Type The task name. Set to StreamToText. Yes
ClassWordFile The path to a list of new words and weightings to add to the language model at load time.  
Conf Whether to generate word confidence scores.  
CustomLm The custom language model to use.  
Diag Whether to generate diagnostic information.  
DiagFile The file to write the diagnostic information to.  
DnnFile The DNN file to use.  
DnnScale The DNN output acoustic score scaling factor.  
ForceRecompoundOff Whether to prevent recompounding.  
ForceRecompoundOn Whether to force recompounding.  
FrameDupl An integer value which allows for greater time efficiency with only a minimal loss of recognition accuracy.  
Lang The language pack to use. Yes
LatFile The name of the lattice file that contains word hypotheses.  
LatScale The depth of the lattice.  
LatWinSize The size (in seconds) of the lattice output window.  
LatWordFile A list of words to find.  
Mode The algorithm mode for the speech-to-text process.  
ModeValue The value of the parameter associated with the speech-to-text algorithm mode.  
Out The file to write the transcription to. Yes
PronFile A file to use to either replace or add alternative pronunciations of words at language load time.  
Punctuation Whether to add punctuation to the word data.  
SpeedBiasLevel The balance between speed and accuracy in the decoder.  
WordBar Switches on word barring.  
WordBarList The location of a list of words to be barred.  

Example

http://localhost:13000/action=AddTask&Type=StreamToText&Lang=ENUK&Out=Transcript1.ctm

This action uses port 13000 to instruct IDOL Speech Server, which is located on the local machine, to transcribe the audio stream using the ENUK language pack and to write the results to the Transcript1.ctm file.


_HP_HTML5_bannerTitle.htm