- initial model is the three states of the 's' phone model trained on
Alphadigit data
- adaptation data is extracted from forced alignments of 21 utterances by
one speaker. (267 adaptation observations for state 1, 485 for state
2 and 122 for state 3)
- as an example, we consider a 2D model with the first two cepstral
coefficients