The WavToPlh
task is deprecated in IDOL Speech Server version 11.5 and later. HPE recommends that you use the newer Deep Neural Network (DNN) techniques, rather than GMM-based acoustic models.
This task is still available for existing implementations, but it might be incompatible with new functionality. The task might be deleted in future.
The WavToPlh
task creates an audio feature (.plh) file from an audio file. The AmTrain task uses audio feature files when adapting acoustic models.
Parameter | Description | Required |
---|---|---|
Type | The task name. Set to WavToPlh . |
Yes |
File | The audio file to process. | Yes |
NormFile | The acoustic normalization file (.ian file in the language pack) to use. | Yes |
Out | The feature file to create. | Yes |
Sfreq | The sample frequency of the audio file to process. | Only if the frequency of the audio file is not 16,000 Hz |
SugdInputChannels | The channel layout of the input media file. | |
SugdInputFrequency | The sampling rate of the input media file. |
http://localhost:13000/action=AddTask&Type=WavToPlh&File=C:\myData\Speech.wav&NormFile=C:\SPKID\SPKID16k.ian&Out=Speech1.plh
This action uses port 13000
to instruct IDOL Speech Server, which is located on the local machine, to produce the Speech1.plh
file from the Speech.wav
audio file, using the SPKID16.ian
normalization file.
|