Statistical methods for speech recognition
Statistical methods for speech recognition
Automatic learning of language model structure
COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Part-of-speech tagging using virtual evidence and negative training
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Backoff model training using partially observed data: application to dialog act tagging
HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
Multimodal generation in the COMIC dialogue system
ACLdemo '05 Proceedings of the ACL 2005 on Interactive poster and demonstration sessions
Detecting word substitutions: PMI vs. HMM
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Morph-based speech recognition and modeling of out-of-vocabulary words across languages
ACM Transactions on Speech and Language Processing (TSLP)
Modeling Topic and Role Information in Meetings Using the Hierarchical Dirichlet Process
MLMI '08 Proceedings of the 5th international workshop on Machine Learning for Multimodal Interaction
Syllable Based Language Model for Large Vocabulary Continuous Speech Recognition of Polish
TSD '08 Proceedings of the 11th international conference on Text, Speech and Dialogue
Neurocomputing
Moses: open source toolkit for statistical machine translation
ACL '07 Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions
Multi-speaker language modeling
HLT-NAACL-Short '04 Proceedings of HLT-NAACL 2004: Short Papers
Factored neural language models
NAACL-Short '06 Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers
Learning Bayesian networks for semantic frame composition in a spoken dialog system
NAACL-Short '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers
Design of the moses decoder for statistical machine translation
SETQA-NLP '08 Software Engineering, Testing, and Quality Assurance for Natural Language Processing
Local search for balanced submodular clusterings
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Designing an extensible API for integrating language modeling and realization
Software '05 Proceedings of the Workshop on Software
CCG supertags in factored statistical machine translation
StatMT '07 Proceedings of the Second Workshop on Statistical Machine Translation
English-to-Czech factored machine translation
StatMT '07 Proceedings of the Second Workshop on Statistical Machine Translation
Jointly labeling multiple sequences: a factorial HMM approach
ACLstudent '05 Proceedings of the ACL Student Research Workshop
Linguistically naïve != language independent: why NLP needs linguistic typology
ILCL '09 Proceedings of the EACL 2009 Workshop on the Interaction between Linguistics and Computational Linguistics: Virtuous, Vicious or Vacuous?
Improved language modeling for statistical machine translation
ParaText '05 Proceedings of the ACL Workshop on Building and Using Parallel Texts
Perceptron reranking for CCG realization
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1 - Volume 1
A joint language model with fine-grain syntactic tags
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Topic-Dependent Language Model with Voting on Noun History
ACM Transactions on Asian Language Information Processing (TALIP)
Using prosodic features in language models for meetings
MLMI'07 Proceedings of the 4th international conference on Machine learning for multimodal interaction
Overview of Morpho challenge 2008
CLEF'08 Proceedings of the 9th Cross-language evaluation forum conference on Evaluating systems for multilingual and multimodal information access
Morpho challenge evaluation by information retrieval experiments
CLEF'08 Proceedings of the 9th Cross-language evaluation forum conference on Evaluating systems for multilingual and multimodal information access
A hybrid morphologically decomposed factored language models for Arabic LVCSR
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Phrase-based statistical language generation using graphical models and active learning
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Perplexity of n-gram and dependency language models
TSD'10 Proceedings of the 13th international conference on Text, speech and dialogue
Overview and results of Morpho challenge 2009
CLEF'09 Proceedings of the 10th cross-language evaluation forum conference on Multilingual information access evaluation: text retrieval experiments
Factored bilingual n-gram language models for statistical machine translation
Machine Translation
Hierarchical Bayesian language models for conversational speech recognition
IEEE Transactions on Audio, Speech, and Language Processing
Improved text generation using n-gram statistics
IBERAMIA'10 Proceedings of the 12th Ibero-American conference on Advances in artificial intelligence
Generating tailored, comparative descriptions with contextually appropriate intonation
Computational Linguistics
Highly-inflected language generation using factored language models
CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part I
Integrating history-length interpolation and classes in language modeling
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Confidence-weighted learning of factored discriminative language models
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Improved modeling of out-of-vocabulary words using morphological classes
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Measures to detect word substitution in intercepted communication
ISI'06 Proceedings of the 4th IEEE international conference on Intelligence and Security Informatics
Wider context by using bilingual language models in machine translation
WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
Dependency-based n-gram models for general purpose sentence realisation
Natural Language Engineering
Syntactic decision tree LMs: random selection or intelligent design?
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Statistical machine translation with local language models
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Multistream recognition of dialogue acts in meetings
MLMI'06 Proceedings of the Third international conference on Machine Learning for Multimodal Interaction
Integrating Generative and Discriminative Character-Based Models for Chinese Word Segmentation
ACM Transactions on Asian Language Information Processing (TALIP)
A scalable distributed syntactic, semantic, and lexical language model
Computational Linguistics
Hierarchical Bayesian language modelling for the linguistically informed
EACL '12 Proceedings of the Student Research Workshop at the 13th Conference of the European Chapter of the Association for Computational Linguistics
Continuous space translation models with neural networks
NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Multiple model text normalization for the polish language
ISMIS'12 Proceedings of the 20th international conference on Foundations of Intelligent Systems
Parsing models for identifying multiword expressions
Computational Linguistics
Statistical machine translation enhancements through linguistic levels: A survey
ACM Computing Surveys (CSUR)
Hi-index | 0.00 |
We introduce factored language models (FLMs) and generalized parallel backoff (GPB). An FLM represents words as bundles of features (e.g., morphological classes, stems, data-driven clusters, etc.), and induces a probability model covering sequences of bundles rather than just words. GPB extends standard backoff to general conditional probability tables where variables might be heterogeneous types, where no obvious natural (temporal) backoff order exists, and where multiple dynamic backoff strategies are allowed. These methodologies were implemented during the JHU 2002 workshop as extensions to the SRI language modeling toolkit. This paper provides initial perplexity results on both CallHome Arabic and on Penn Treebank Wall Street Journal articles. Significantly, FLMs with GPB can produce bigrams with significantly lower perplexity, sometimes lower than highly-optimized baseline trigrams. In a multi-pass speech recognition context, where bigrams are used to create first-pass bigram lattices or N-best lists, these results are highly relevant.