SEGMENTATION AND CLASSIFICATION
Need to classify data into
- Speech
- Music
- Speech with Music
Segmentation Strategies
- BBN --- dual gender phone decoding, chop at pauses and gender turns
- CU-HTK --- frequency-based classification, gender-dependent phone decoding
- CU-Con --- syllable boundary information
- CMU --- GMM classifier, entropy-based segmentation at silences
- Dragon --- amplitude-based silence detection, phoneme recognition
- IBM --- GMM classifier, turn detection using BIC
- LIMSI --- GMM classifier, bottom-up agglomerative clustering
- OGI --- used CMU segmentations
- Philips --- frequency-based classification, gender-dependent phone decoding
- SRI --- context-dependent PTM classifier, bottom-up agglomerative clustering