•
Typical LVCSR systems have about 10M free parameters, which makes
training a challenge.
•
Large
speech databases are required (several hundred hours of speech).
•
Tying, smoothing, and interpolation are required.