If a transcript is available for an audio recording, the TranscriptAlign
task can place time locations for each word in the transcript. This task is suitable for aligning subtitles to audio or video files.
Parameter | Description | Required |
---|---|---|
Type | The task name. Set to TranscriptAlign . |
Yes |
CtmFile | The speech-to-text transcript produced for the audio file. | Yes |
Lang | The language pack to use as a foundation. | |
MatchType | The alignment mode. | Yes |
Out | The file to write the aligned transcript to. | Yes |
ScoreBeam | The score-based beam-width value to use during dynamic programming alignment. | |
TxtFile | The transcript file. | Yes |
WordBeam | The word region-based beam-width value to use during dynamic programming alignment. |
http://localhost:13000/action=AddTask&Type=TranscriptAlign&TxtFile=C:\data\transcript.txt&CtmFile=C:\misc\speechtext.ctm&Out=AlignedTranscript.ctm&MatchType=words
This action uses port 13000
to instruct HPE IDOL Speech Server, which is located on the local machine, to compare the transcript transcript.txt
with the speech-to-text output speechtext.ctm
to produce an aligned transcript, AlignedTranscript.ctm
. The action instructs HPE IDOL Speech Server to use the words
alignment mode.
|