If a transcript is available for an audio recording, the TranscriptAlign
task can place time locations for each word in the transcript. This task is suitable for aligning subtitles to audio or video files.
Parameter | Description | Required |
---|---|---|
Type | The task name. Set to TranscriptAlign . |
Yes |
CtmFile | The speech-to-text transcript produced for the audio file. | Yes |
Lang | The language pack to use as a foundation. | |
MatchType | The alignment mode. | Yes |
Out | The file to write the aligned transcript to. | Yes |
ScoreBeam | The score-based beam-width value to use during dynamic programming alignment. | |
TxtFile | The transcript file. | Yes |
WordBeam | The word region-based beam-width value to use during dynamic programming alignment. |
http://localhost:15000/action=AddTask&Type=TranscriptAlign&TxtFile=C:\data\transcript.txt&CtmFile=C:\misc\speechtext.ctm&Out=AlignedTranscript.ctm&MatchType=words
This action compares the transcript transcript.txt
with the speech-to-text output speechtext.ctm
to produce an aligned transcript, AlignedTranscript.ctm
. IDOL Speech Server uses the words
alignment mode.
|