Modeling Monosyllabic Words

What is in the training data?

Category
Count/Percentage
Unique Words15,127
Number of Word Tokens659,713
Number of Monosyllabic Words *529
Training tokens covered by top 200 Monosyllabic Words95%
Word tokens covered by top 529 Monosyllabic Words75%
Word tokens covered by top 200 Monosyllabic Words71%

* Dependent on the alignment and lexicon