• system: 200 monosyllabic word models + 632 syllable models + triphones

  • all words and 260 most frequent syllables were split at geometric mean of 25th and 75th percentiles of duration histogram

  • new models: seeded from original models, re-defined number of states
    (1/2 * duration at 25% / 75%)

  • four passes of re-estimation