Improving automatic sentence boundary detection with confusion networks

Authors:
D. Hillard;M. Ostendorf;A. Stolcke;Y. Liu;E. Shriberg
Affiliations:
University of Washington, EE;University of Washington, EE;ICSI and SRI International;ICSI;ICSI and SRI International
Venue:
HLT-NAACL-Short '04 Proceedings of HLT-NAACL 2004: Short Papers
Year:
2004

Citing 1
Cited 1

Prosody-based automatic segmentation of speech into sentences and topics

Speech Communication - Special issue on accessing information in spoken audio

Cascaded model adaptation for dialog act segmentation and tagging

Computer Speech and Language

Quantified Score

Hi-index	0.00

Visualization

Abstract

We extend existing methods for automatic sentence boundary detection by leveraging multiple recognizer hypotheses in order to provide robustness to speech recognition errors. For each hypothesized word sequence, an HMM is used to estimate the posterior probability of a sentence boundary at each word boundary. The hypotheses are combined using confusion networks to determine the overall most likely events. Experiments show improved detection of sentences for conversational telephone speech, though results are mixed for broadcast news.