•
Speech recognition typically
produces a word- level
time-aligned annotation
•
Time alignments for other levels
of information also available