Factored language models and generalized parallel backoff

Authors:
Jeff A. Bilmes;Katrin Kirchhoff
Affiliations:
University of Washington;University of Washington
Venue:
NAACL-Short '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology: companion volume of the Proceedings of HLT-NAACL 2003--short papers - Volume 2
Year:
2003

Citing 1
Cited 55

Statistical methods for speech recognition

Statistical methods for speech recognition

Automatic learning of language model structure

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Part-of-speech tagging using virtual evidence and negative training

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Backoff model training using partially observed data: application to dialog act tagging

HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
Multimodal generation in the COMIC dialogue system

ACLdemo '05 Proceedings of the ACL 2005 on Interactive poster and demonstration sessions
Detecting word substitutions: PMI vs. HMM

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Morph-based speech recognition and modeling of out-of-vocabulary words across languages

ACM Transactions on Speech and Language Processing (TSLP)
Modeling Topic and Role Information in Meetings Using the Hierarchical Dirichlet Process

MLMI '08 Proceedings of the 5th international workshop on Machine Learning for Multimodal Interaction
Syllable Based Language Model for Large Vocabulary Continuous Speech Recognition of Polish

TSD '08 Proceedings of the 11th international conference on Text, Speech and Dialogue
Factored sequence kernels

Neurocomputing
Moses: open source toolkit for statistical machine translation

ACL '07 Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions
Multi-speaker language modeling

HLT-NAACL-Short '04 Proceedings of HLT-NAACL 2004: Short Papers
Factored neural language models

NAACL-Short '06 Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers
Learning Bayesian networks for semantic frame composition in a spoken dialog system

NAACL-Short '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers
Design of the moses decoder for statistical machine translation

SETQA-NLP '08 Software Engineering, Testing, and Quality Assurance for Natural Language Processing
Local search for balanced submodular clusterings

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Designing an extensible API for integrating language modeling and realization

Software '05 Proceedings of the Workshop on Software
CCG supertags in factored statistical machine translation

StatMT '07 Proceedings of the Second Workshop on Statistical Machine Translation
English-to-Czech factored machine translation

StatMT '07 Proceedings of the Second Workshop on Statistical Machine Translation
Jointly labeling multiple sequences: a factorial HMM approach

ACLstudent '05 Proceedings of the ACL Student Research Workshop
Linguistically naïve != language independent: why NLP needs linguistic typology

ILCL '09 Proceedings of the EACL 2009 Workshop on the Interaction between Linguistics and Computational Linguistics: Virtuous, Vicious or Vacuous?
Improved language modeling for statistical machine translation

ParaText '05 Proceedings of the ACL Workshop on Building and Using Parallel Texts
Perceptron reranking for CCG realization

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1 - Volume 1
A joint language model with fine-grain syntactic tags

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Topic-Dependent Language Model with Voting on Noun History

ACM Transactions on Asian Language Information Processing (TALIP)
Using prosodic features in language models for meetings

MLMI'07 Proceedings of the 4th international conference on Machine learning for multimodal interaction
Overview of Morpho challenge 2008

CLEF'08 Proceedings of the 9th Cross-language evaluation forum conference on Evaluating systems for multilingual and multimodal information access
Morpho challenge evaluation by information retrieval experiments

CLEF'08 Proceedings of the 9th Cross-language evaluation forum conference on Evaluating systems for multilingual and multimodal information access
A hybrid morphologically decomposed factored language models for Arabic LVCSR

HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Phrase-based statistical language generation using graphical models and active learning

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Perplexity of n-gram and dependency language models

TSD'10 Proceedings of the 13th international conference on Text, speech and dialogue
Overview and results of Morpho challenge 2009

CLEF'09 Proceedings of the 10th cross-language evaluation forum conference on Multilingual information access evaluation: text retrieval experiments
Factored bilingual n-gram language models for statistical machine translation

Machine Translation
Hierarchical Bayesian language models for conversational speech recognition

IEEE Transactions on Audio, Speech, and Language Processing
Improved text generation using n-gram statistics

IBERAMIA'10 Proceedings of the 12th Ibero-American conference on Advances in artificial intelligence
Generating tailored, comparative descriptions with contextually appropriate intonation

Computational Linguistics
Highly-inflected language generation using factored language models

CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part I
Integrating history-length interpolation and classes in language modeling

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Confidence-weighted learning of factored discriminative language models

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Improved modeling of out-of-vocabulary words using morphological classes

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Syntactic discriminative language model rerankers for statistical machine translation

Machine Translation
Measures to detect word substitution in intercepted communication

ISI'06 Proceedings of the 4th IEEE international conference on Intelligence and Security Informatics
Wider context by using bilingual language models in machine translation

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
Dependency-based n-gram models for general purpose sentence realisation

Natural Language Engineering
Syntactic decision tree LMs: random selection or intelligent design?

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Statistical machine translation with local language models

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Orthographic and morphological processing for English---Arabic statistical machine translation

Machine Translation
Multistream recognition of dialogue acts in meetings

MLMI'06 Proceedings of the Third international conference on Machine Learning for Multimodal Interaction
Integrating Generative and Discriminative Character-Based Models for Chinese Word Segmentation

ACM Transactions on Asian Language Information Processing (TALIP)
A scalable distributed syntactic, semantic, and lexical language model

Computational Linguistics
Hierarchical Bayesian language modelling for the linguistically informed

EACL '12 Proceedings of the Student Research Workshop at the 13th Conference of the European Chapter of the Association for Computational Linguistics
Continuous space translation models with neural networks

NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
A comparative investigation of morphological language modeling for the languages of the European union

NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Multiple model text normalization for the polish language

ISMIS'12 Proceedings of the 20th international conference on Foundations of Intelligent Systems
Parsing models for identifying multiword expressions

Computational Linguistics
Statistical machine translation enhancements through linguistic levels: A survey

ACM Computing Surveys (CSUR)

Quantified Score

Hi-index	0.00

Visualization

Abstract

We introduce factored language models (FLMs) and generalized parallel backoff (GPB). An FLM represents words as bundles of features (e.g., morphological classes, stems, data-driven clusters, etc.), and induces a probability model covering sequences of bundles rather than just words. GPB extends standard backoff to general conditional probability tables where variables might be heterogeneous types, where no obvious natural (temporal) backoff order exists, and where multiple dynamic backoff strategies are allowed. These methodologies were implemented during the JHU 2002 workshop as extensions to the SRI language modeling toolkit. This paper provides initial perplexity results on both CallHome Arabic and on Penn Treebank Wall Street Journal articles. Significantly, FLMs with GPB can produce bigrams with significantly lower perplexity, sometimes lower than highly-optimized baseline trigrams. In a multi-pass speech recognition context, where bigrams are used to create first-pass bigram lattices or N-best lists, these results are highly relevant.