TranscriptAlign

If a transcript is available for an audio recording, the TranscriptAlign task can place time locations for each word in the transcript. This task is suitable for aligning subtitles to audio or video files.

Parameters

Parameter Description Required
Type The task name. Set to TranscriptAlign. Yes
CtmFile The speech-to-text transcript produced for the audio file. Yes
Lang The language pack to use as a foundation.  
MatchType The alignment mode. Yes
Out The file to write the aligned transcript to. Yes
ScoreBeam The score-based beam-width value to use during dynamic programming alignment.  
TxtFile The transcript file. Yes
WordBeam The word region-based beam-width value to use during dynamic programming alignment.  

Example

http://localhost:15000/action=AddTask&Type=TranscriptAlign&TxtFile=C:\data\transcript.txt&CtmFile=C:\misc\speechtext.ctm&Out=AlignedTranscript.ctm&MatchType=words

This action compares the transcript transcript.txt with the speech-to-text output speechtext.ctm to produce an aligned transcript, AlignedTranscript.ctm. IDOL Speech Server uses the words alignment mode.


_FT_HTML5_bannerTitle.htm