THE BROADCAST NEWS RECOGNITION TASK

Television and radio news data and speeches

Various focus conditions

  • F0 - Baseline
  • F1 - Spontaneous
  • F2 - Telephone
  • F3 - Degraded
  • F4 - Music
  • F5 - Non-native
  • FX - Other

Test data selection

  • To minimize year-to-year variability
  • To use recent broadcasts for keeping LMs current
  • Randomized excerpts selection
  • To represent the training set (not to maximize focus condition coverage)
  • 166 minutes of news, 144 variable-length excerpts
  • 14 one-minute speech excerpts
  • Audrey Le got commended for her work on test data selection