SEGMENTATION CONVENTIONS
Utterances consist of speech, typically with a 0.5 secs. silence
padding
Utterances do not typically exceed 15 secs. in duration
Breakpoints inserted at natural points in the utterance,
like change in topic, or end of sentences and paragraphs