• Most phone-based systems have 3 state HMM models regardless of the task

  • Syllable models are typically chosen to be proportional to average duration for that task

  • The initial model choice is empirical

  • May have impact on the ability of the trained model to fit the training data