Metadata Tag Syntax

IDOL Speech Server allows you to add extra information into a transcript, which it then marks with a metadata tag in the processed transcript, enabling you to easily find it. The extra information does not affect transcription alignment. For example, you might want to mark the point in a lecture transcript where a video was played. You could add the following to the transcript:

...
Let’s
look
at 
this
example
<metadata video = GlobalIssues />
Environmental
issues
...

The aligned transcript that is generated contains the following:

...
1 A 10.1 0.3 Let’s 1.0  
1 A 10.4 0.2 look 1.0  
1 A 10.6 0.3 at 1.0  
1 A 10.9 0.3 this 1.0  
1 A 11.2 0.3 example 1.0  
1 A 11.5 0.0 video = GlobalIssues 1.0 <-- metadata tag
1 A 11.5 0.4 Environmental 1.0  
1 A 11.9 0.3 issues 1.0  

For more information about the aligned transcript format, see Align the Transcript.

Metadata tags must conform to the following syntax to pass through the text normalizer unmodified. The syntax is loosely based on the format of tags used in XML.

Metadata ::= '<metadata' (S Attribute)* S? '/'? '>'
S ::= (#x20 | #x9 | #xD | #xA)+
Attribute ::= Name Eq AttValue
Name ::= NameStartChar (NameChar)*
NameStartChar ::=   ":"
                  | [A-Z]
                  | "_"
                  | [a-z]
                  | [#xC0-#xD6]
                  | [#xD8-#xF6]
                  | [#xF8-#x2FF]
                  | [#x370-#x37D]
                  | [#x37F-#x1FFF]
                  | [#x200C-#x200D]
                  | [#x2070-#x218F]
                  | [#x2C00-#x2FEF]
                  | [#x3001-#xD7FF]
                  | [#xF900-#xFDCF]
                  | [#xFDF0-#xFFFD]
                  | [#x10000-#xEFFFF]
NameChar ::=   NameStartChar
             | "-"
             | "."
             | [0-9]
             | #xB7
             | [#x0300-#x036F]
             | [#x203F-#x2040]
Eq ::= S? '=' S?
AttValue ::=   '"' [^"]* '"'
             | "'" [^']* "'"

_FT_HTML5_bannerTitle.htm