TranscriptAlign

If a transcript is available for an audio recording, the TranscriptAlign task can place time locations for each word in the transcript. This task is suitable for aligning subtitles to audio or video files.

Parameters

Parameter Description Required
Type The task name. Set to TranscriptAlign. Yes
CtmFile The speech-to-text transcript produced for the audio file. Yes
Lang The language pack to use as a foundation.  
MatchType The alignment mode. Yes
Out The file to write the aligned transcript to. Yes
ScoreBeam The score-based beam-width value to use during dynamic programming alignment.  
TxtFile The transcript file. Yes
WordBeam The word region-based beam-width value to use during dynamic programming alignment.  

Example

http://localhost:13000/action=AddTask&Type=TranscriptAlign&TxtFile=C:\data\transcript.txt&CtmFile=C:\misc\speechtext.ctm&Out=AlignedTranscript.ctm&MatchType=words

This action uses port 13000 to instruct IDOL Speech Server, which is located on the local machine, to compare the transcript transcript.txt with the speech-to-text output speechtext.ctm to produce an aligned transcript, AlignedTranscript.ctm. The action instructs IDOL Speech Server to use the words alignment mode.


_HP_HTML5_bannerTitle.htm