Overview

HPE IDOL Speech Server allows you to adapt the acoustic models that are available out of the box to more closely match the acoustic properties of particular sets of audio data. Adapting the model using data that closely represents (in terms of recording quality and accents) the audio that you expect to process should improve speech-to-text results.

Adapting an acoustic model involves a series of steps:

  1. Prepare the data set. The data set must include audio and verbatim transcripts of the audio. Preparation of the files involves:

  2. Use the AmTrain task to ingest the audio and transcription data.
  3. Use the AmTrainFinal task to produce the updated acoustic model.
  4. Repeat steps 2 and 3 multiple times, each time using the latest adapted acoustic model.

These steps are covered in detail in the following sections.

NOTE:

For this procedure, your HPE IDOL Speech Server license must include the align module (required for transcription alignment).


_HP_HTML5_bannerTitle.htm