PILOT EXPERIMENT: DATA SPLITTING IN STATE SPACE
baseline system: 800 syllables (8 mixtures per state) + triphones
extraction of trajectories:
clustering of training data for top 485 syllable models based on distance to original model: state acoustic likelihood
(Korkmazskiy et al. 1997)
splitting of distance data into two clusters along mean of distribution
new model topologies:
two separate paths, 4 mixtures per state, different numbers of states