SegmentWav

The SegmentWav task segments an audio file by speaker. The task does not require any speaker identification information because HPE IDOL Speech Server estimates speaker changes from changes in the audio.

Parameters

Parameter Description Required
Type The task name. Set to SegmentWav. Yes
File The audio file to process. Yes
MaxSeg The maximum duration of an audio segment in seconds.  
MinSeg The minimum duration of an audio segment in seconds.  
NormFile The acoustic normalization file (.ian file from language pack) to use. Yes
Out The file to write the results to. Yes
Sensitivity Controls the number of segments that are produced.  
DEPRECATED:

The Norm parameter is deprecated for HPE IDOL Speech Server version 10.6.0 and later. Use the NormFile parameter instead.

The Norm parameter is still available for existing implementations, but it might be incompatible with new functionality. The parameter might be deleted in future.

Example

http://localhost:13000/action=AddTask&Type=SegmentWav&File=C:\Data\Speech.wav&NormFile=C:\LP\ver-ENUK-5.0-16k.ian&Out=SpeakerChanges.ctm

This action uses port 13000 to instruct HPE IDOL Speech Server, which is located on the local machine, to estimate speaker changes in the Speech.wav file and to write the results to the SpeakerChanges.ctm file.


_HP_HTML5_bannerTitle.htm