HPE recommends that you have no more than one HPE IDOL Speech Server for each machine, and that you use the following machine specifications:
Memory requirements depend on the number of language packs, the number of simultaneous decode tasks, and the operating system.
600 MB for each language pack (shared across multiple channels).
HPE recommends that you assign each speech-to-text task to a single, 2 GHz core.
HPE IDOL Speech Server has the following memory requirements:
Each language pack requires approximately 500 MB of RAM to load.
Note: Broadband and telephony versions of the same language pack count as separate language packs internally. Similarly, if you load the same language pack without a DNN, with a small DNN (for example, for real-time processing), and with the standard DNN, this counts as three separate language packs.
Each HPE IDOL Speech Server task requires additional memory. HPE recommends the following approximate values. (If you run multiple tasks simultaneously, the amount of memory that is required increases; for example, to run two concurrent tasks that each use 250 MB of memory, 500 MB of memory is required.)
Task type | Memory |
---|---|
Speech-to-text | 150 MB1Stereo speech-to-text uses two stt modules and therefore uses twice the memory of a standard speech-to-text task. |
Language model building | 300 MB2If the training texts contain unusually large vocabularies, the language model building tasks might require more memory. |
Acoustic model adaptation | 750 MB |
Speaker identification | 100 MB |
Transcript alignment | 250 MB |
Language identification | 250 MB |
Language identification training | 300 MB |
Language identification optimization | 10 MB |
To ensure smooth operation, the speechserver.cfg
configuration file allows you to limit the number of concurrent actions on the server. You must also be careful when you set the maximum number of language models that the server can load.
|