ADJUSTMENTS TO SWB SYSTEMS



Restructured Models
  • Took median duration from forced alignment as new duration

  • Trained models from scratch on the OGI data (Exp_011)

  • Made silence at beginning and ending of test utterances optional

  • Implications
  • Training and testing take twice as long (still faster than phones)

  • Total number of Gaussians in system are twice that of the word-internal phone system