A Cache-Based Natural Language Model for Speech Recognition

Authors:
R. Kuhn;R. De Mori
Affiliations:
-;-
Venue:
IEEE Transactions on Pattern Analysis and Machine Intelligence
Year:
1990

Citing 3
Cited 76

Operating system concepts (2nd ed.)

Operating system concepts (2nd ed.)
Word frequency and text type: some observations based on the LOB corpus of British English texts

Computers and the Humanities
Natural Language Modeling for Phoneme-to-Text Transcription

IEEE Transactions on Pattern Analysis and Machine Intelligence

Some results on stochastic language modeling

HLT '91 Proceedings of the workshop on Speech and Natural Language
Studies in part of speech labelling

HLT '91 Proceedings of the workshop on Speech and Natural Language
Towards understanding text with a very large vocabulary

HLT '90 Proceedings of the workshop on Speech and Natural Language
Computation of Probabilities for an Island-Driven Parser

IEEE Transactions on Pattern Analysis and Machine Intelligence
Improving statistical language model performance with automatically generated word hierarchies

Computational Linguistics
A Review of Statistical Language Processing Techniques

Artificial Intelligence Review
Statistical Models for Text Segmentation

Machine Learning - Special issue on natural language learning
Document centered approach to text normalization

SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Target-Text Mediated Interactive Machine Translation

Machine Translation
Unit Completion for a Computer-aided Translation Typing System

Machine Translation
Corrections to "A Cache-Based Language Model for Speech Recognition"

IEEE Transactions on Pattern Analysis and Machine Intelligence
On the Estimation of 'Small' Probabilities by Leaving-One-Out

IEEE Transactions on Pattern Analysis and Machine Intelligence
Testing the correlation of word error rate and perplexity

Speech Communication
Periods, capitalized words, etc.

Computational Linguistics
Variable Length Language Model for Chinese Character Recognition

ICMI '00 Proceedings of the Third International Conference on Advances in Multimodal Interfaces
Segmenting Conversations by Topic, Initiative, and Style

Information Retrieval Techniques for Speech Applications [this book is based on the workshop “Information Retrieval Techniques for Speech Applications”, held as part of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in New Orleans, USA, in September 2001].
Pattern matching for design concept localization

WCRE '95 Proceedings of the Second Working Conference on Reverse Engineering
Task adaptation in stochastic language model for Chinese homophone disambiguation

ACM Transactions on Asian Language Information Processing (TALIP)
Dialogue act modeling for automatic tagging and recognition of conversational speech

Computational Linguistics
Coping with ambiguity and unknown words through probabilistic models

Computational Linguistics - Special issue on using large corpora: II
Real-time automatic insertion of accents in French text

Natural Language Engineering
Topic-based mixture language modelling

Natural Language Engineering
A model of lexical attraction and repulsion

ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Trigger-pair predictors in parsing and tagging

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
A dynamic language model based on individual word domains

COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 2
Dynamic nonlocal language modeling via hierarchical topic-based adaptation

ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Chinese named entity identification using class-based language model

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Entropy rate constancy in text

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Improvements in stochastic language modeling

HLT '91 Proceedings of the workshop on Speech and Natural Language
Recent topics in speech recognition research at NTT laboratories

HLT '91 Proceedings of the workshop on Speech and Natural Language
Language modeling with sentence-level mixtures

HLT '94 Proceedings of the workshop on Human Language Technology
TransType: a computer-aided translation typing system

NAACL-ANLP-EMTS '00 Proceedings of the 2000 NAACL-ANLP Workshop on Embedded machine translation systems - Volume 5
Nonlocal language modeling based on context co-occurrence vectors

EMNLP '00 Proceedings of the 2000 Joint SIGDAT conference on Empirical methods in natural language processing and very large corpora: held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics - Volume 13
Understanding without formality: augmenting speech recognition to understand informal verbal commands

Proceedings of the 43rd annual Southeast regional conference - Volume 1
Modeling local coherence: an entity-based approach

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Integrating syntactic priming into an incremental probabilistic parser, with an application to psycholinguistic modeling

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Parallelism in coordination as an instance of syntactic priming: evidence from corpus-based modeling

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Continuous space language models

Computer Speech and Language
Word-based predictive text entry using adaptive language models

Natural Language Engineering
Combining statistical data analysis techniques to extract topical keyword classes from corpora

Intelligent Data Analysis
An adaptive approach to named entity extraction for meeting applications

HLT '02 Proceedings of the second international conference on Human Language Technology Research
Modeling local coherence: An entity-based approach

Computational Linguistics
Sibylle, An Assistive Communication System Adapting to the Context and Its User

ACM Transactions on Accessible Computing (TACCESS)
Cache-based Statistical Language Models of English and Highly Inflected Lithuanian

Informatica
Context-Aware Users' Preference Models by Integrating Real and Supposed Situation Data

IEICE - Transactions on Information and Systems
Style & topic language model adaptation using HMM-LDA

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
TransType: a computer-aided translation typing system

EmbedMT '00 ANLP-NAACL 2000 Workshop: Embedded Machine Translation Systems
Implicitly supervised language model adaptation for meeting transcription

NAACL-Short '07 Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers
Mixture-model adaptation for SMT

StatMT '07 Proceedings of the Second Workshop on Statistical Machine Translation
POST: using probabilities in language processing

IJCAI'91 Proceedings of the 12th international joint conference on Artificial intelligence - Volume 2
Cache-based language model adaptation using visual attention for ASR in meeting scenarios

Proceedings of the 2009 international conference on Multimodal interfaces
Web-based topic language modeling for audio indexing

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Language models based on semantic composition

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1 - Volume 1
An artificial immune network approach for pinyin-to- character conversion

VECIMS'09 Proceedings of the 2009 IEEE international conference on Virtual Environments, Human-Computer Interfaces and Measurement Systems
Topic-Dependent Language Model with Voting on Noun History

ACM Transactions on Asian Language Information Processing (TALIP)
To cache or not to cache?: experiments with adaptive models in statistical machine translation

WMT '10 Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR
Context adaptation in statistical machine translation using models with exponentially decaying cache

DANLP 2010 Proceedings of the 2010 Workshop on Domain Adaptation for Natural Language Processing
Language pyramid and multi-scale text analysis

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Topic tracking language model for speech recognition

Computer Speech and Language
Bayesian adaptation for statistical machine translation

SSPR&SPR'10 Proceedings of the 2010 joint IAPR international conference on Structural, syntactic, and statistical pattern recognition
Mining monolingual and bilingual corpora

Intelligent Data Analysis
Task adaptation in stochastic language models for continuous speech recognition

ICASSP'92 Proceedings of the 1992 IEEE international conference on Acoustics, speech and signal processing - Volume 1
Log-linear weight optimisation via Bayesian adaptation in statistical machine translation

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
On the dynamic adaptation of stochastic language models

ICASSP'93 Proceedings of the 1993 IEEE international conference on Acoustics, speech, and signal processing: speech processing - Volume II
TransSearch: from a bilingual concordancer to a translation finder

Machine Translation
Integrating imperfect transcripts into speech recognition systems for building high-quality corpora

Computer Speech and Language
Ending-based strategies for part-of-speech tagging

UAI'94 Proceedings of the Tenth international conference on Uncertainty in artificial intelligence
A three level cache-based adaptive chinese language model

IJCNLP'04 Proceedings of the First international joint conference on Natural Language Processing
Cache-based document-level statistical machine translation

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
The latent words language model

Computer Speech and Language
Reconstructing corrupt DEFLATEd files

Digital Investigation: The International Journal of Digital Forensics & Incident Response
On using context for automatic correction of non-word misspellings in student essays

Proceedings of the Seventh Workshop on Building Educational Applications Using NLP
Measuring the influence of long range dependencies with neural network language models

WLM '12 Proceedings of the NAACL-HLT 2012 Workshop: Will We Ever Really Replace the N-gram Model? On the Future of Language Modeling for HLT
On the dynamic adaptation of language models based on dialogue information

Expert Systems with Applications: An International Journal
Influence relation estimation based on lexical entrainment in conversation

Speech Communication
Leveraging relevance cues for language modeling in speech recognition

Information Processing and Management: an International Journal

Quantified Score

Hi-index	0.15

Visualization

Abstract

Speech-recognition systems must often decide between competing ways of breaking up the acoustic input into strings of words. Since the possible strings may be acoustically similar, a language model is required; given a word string, the model returns its linguistic probability. Several Markov language models are discussed. A novel kind of language model which reflects short-term patterns of word use by means of a cache component (analogous to cache memory in hardware terminology) is presented. The model also contains a 3g-gram component of the traditional type. The combined model and a pure 3g-gram model were tested on samples drawn from the Lancaster-Oslo/Bergen (LOB) corpus of English text. The relative performance of the two models is examined, and suggestions for the future improvements are made.