The StreamToText
task is deprecated in IDOL Speech Server version 11.5 and later. Use the SpeechToText task instead.
This task is still available for existing implementations, but it might be incompatible with new functionality. The task might be deleted in future.
The StreamToText
task converts live audio into a text transcript.
Parameter | Description | Required |
---|---|---|
Type | The task name. Set to StreamToText . |
Yes |
ClassWordFile | The path to a list of new words and weightings to add to the language model at load time. | |
Conf | Whether to generate word confidence scores. | |
CustomLm | The custom language model to use. | |
Diag | Whether to generate diagnostic information. | |
DiagFile | The file to write the diagnostic information to. | |
DnnFile | The DNN file to use. | |
DnnScale | The DNN output acoustic score scaling factor. | |
ForceRecompoundOff | Whether to prevent recompounding. | |
ForceRecompoundOn | Whether to force recompounding. | |
FrameDupl | An integer value which allows for greater time efficiency with only a minimal loss of recognition accuracy. | |
Lang | The language pack to use. | Yes |
LatFile | The name of the lattice file that contains word hypotheses. | |
LatScale | The depth of the lattice. | |
LatWinSize | The size (in seconds) of the lattice output window. | |
LatWordFile | A list of words to find. | |
Mode | The algorithm mode for the speech-to-text process. | |
ModeValue | The value of the parameter associated with the speech-to-text algorithm mode. | |
Out | The file to write the transcription to. | Yes |
PronFile | A file to use to either replace or add alternative pronunciations of words at language load time. | |
Punctuation | Whether to add punctuation to the word data. | |
SpeedBiasLevel | The balance between speed and accuracy in the decoder. | |
WordBar | Switches on word barring. | |
WordBarList | The location of a list of words to be barred. |
http://localhost:15000/action=AddTask&Type=StreamToText&Lang=ENUK&Out=Transcript1.ctm
This action transcribes the audio stream using the ENUK
language pack and writes the results to the Transcript1.ctm
file.
|