Multi-speaker language modeling

Authors:
Gang Ji;Jeff Bilmes
Affiliations:
University of Washington, Seattle, WA;University of Washington, Seattle, WA
Venue:
HLT-NAACL-Short '04 Proceedings of HLT-NAACL 2004: Short Papers
Year:
2004

Citing 4
Cited 4

Class-based n-gram models of natural language

Computational Linguistics
Multi-Class Composite N-gram language model for spoken language processing using multiple word clusters

ACL '01 Proceedings of the 39th Annual Meeting on Association for Computational Linguistics
Exploring asymmetric clustering for statistical language modeling

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Factored language models and generalized parallel backoff

NAACL-Short '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology: companion volume of the Proceedings of HLT-NAACL 2003--short papers - Volume 2

Automatic learning of language model structure

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Implicitly supervised language model adaptation for meeting transcription

NAACL-Short '07 Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers
Prosodic and temporal features for language modeling for dialog

Speech Communication
Influence relation estimation based on lexical entrainment in conversation

Speech Communication

Quantified Score

Hi-index	0.00

Visualization

Abstract

In conventional language modeling, the words from only one speaker at a time are represented, even for conversational tasks such as meetings and telephone calls. In a conversational or meeting setting, however, speakers can have significant influence on each other. To recover such un-modeled inter-speaker information, we introduce an approach for conversational language modeling that considers words from other speakers when predicting words from the current one. By augmenting a normal trigram context, our new multi-speaker language model (MSLM) improves on both Switchboard and ICSI Meeting Recorder corpora. Using an MSLM and a conditional mutual information based word clustering algorithm, we achieve a 8.9% perplexity reduction on Switchboard and a 12.2% reduction on the ICSI Meeting Recorder data.