Improving automatic sentence boundary detection with confusion networks

  • Authors:
  • D. Hillard;M. Ostendorf;A. Stolcke;Y. Liu;E. Shriberg

  • Affiliations:
  • University of Washington, EE;University of Washington, EE;ICSI and SRI International;ICSI;ICSI and SRI International

  • Venue:
  • HLT-NAACL-Short '04 Proceedings of HLT-NAACL 2004: Short Papers
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

We extend existing methods for automatic sentence boundary detection by leveraging multiple recognizer hypotheses in order to provide robustness to speech recognition errors. For each hypothesized word sequence, an HMM is used to estimate the posterior probability of a sentence boundary at each word boundary. The hypotheses are combined using confusion networks to determine the overall most likely events. Experiments show improved detection of sentences for conversational telephone speech, though results are mixed for broadcast news.