Speaker Normalization
- create a common feature space where the interspeaker variance is minimized
- VTLN - dynamic frequency warping
- trying to make all speakers appear equivalent so a common model set performs well on all speakers
Speaker Adaptation
- the recognition system is adapted to maximize performance for a single speaker
- can use either batch or incremental adaptation
- MLLR, speaker clustering