Use Live Mode for Streaming

When you perform speech-to-text conversion on a live audio stream, you can specify a mode that defines the rate to perform analysis. Versions of HPE IDOL Speech Server from 10.8 upwards and the 6.0+ versions of the language packs use DNN acoustic models to improve speech-to-text accuracy. Each language pack contains at least two DNN acoustic models of different sizes. By default, in fixed mode the larger, most accurate model is used.

To override the default option, specify a different DNN file as the value of the DnnFile parameter in the task configuration file or at the command line.

CAUTION:

You can use DNN acoustic modelling in live or relative mode only if your DNN files are smaller than a certain size. In addition, you must be using Intel (or compatible) Processors that support SIMD extensions SSSE3 and SSE4.1. If this is not possible, you can set the DnnFile parameter to none to allow non-DNN speech-to-text without hardware limitations.

To use live mode in live stream speech-to-text tasks, you must add the Mode configuration parameter to the configuration sections for the stt and stream modules, if it is not already present. For example:

[stt]
Mode=$params.Mode
[stream]
Mode=$params.Mode

This configuration creates a Mode action parameter. To use live mode, set the Mode action parameter to live in a task action that uses the stt and stream modules, such as StreamToText. For example:

http://localhost:13000/action=AddTask&Type=StreamToText&Lang=ENUK&Out=Transcript1.ctm&Mode=Live

_HP_HTML5_bannerTitle.htm