Corpus studies in word prediction

Authors:
Keith Trnka;Kathleen F. McCoy
Affiliations:
University of Delaware;University of Delaware
Venue:
Proceedings of the 9th international ACM SIGACCESS conference on Computers and accessibility
Year:
2007

Citing 10
Cited 7

Two simple prediction algorithms to facilitate text production

ANLC '88 Proceedings of the second conference on Applied natural language processing
Deterministic parsing of syntactic non-fluencies

ACL '83 Proceedings of the 21st annual meeting on Association for Computational Linguistics
Word completion: a first step toward target-text mediated IMT

COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1
Dynamic nonlocal language modeling via hierarchical topic-based adaptation

ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Semantic knowledge in word completion

Proceedings of the 7th international ACM SIGACCESS conference on Computers and accessibility
Topic modeling in fringe word prediction for AAC

Proceedings of the 11th international conference on Intelligent user interfaces
Improved topic-dependent language modeling using information retrieval techniques

ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 01
The effects of word prediction on communication rate for AAC

NAACL-Short '07 Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers
Exploiting long distance collocational relations in predictive typing

TextEntry '03 Proceedings of the 2003 EACL Workshop on Language Modeling for Text Entry Methods
Testing the efficacy of part-of-speech information in word completion

TextEntry '03 Proceedings of the 2003 EACL Workshop on Language Modeling for Text Entry Methods

Sibylle, An Assistive Communication System Adapting to the Context and Its User

ACM Transactions on Accessible Computing (TACCESS)
Adapting word prediction to subject matter without topic-labeled data

Proceedings of the 10th international ACM SIGACCESS conference on Computers and accessibility
Evaluating word prediction: framing keystroke savings

HLT-Short '08 Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers
Adaptive language modeling for word prediction

HLT-SRWS '08 Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Student Research Workshop
Building semantic networks to improve word finding in assistive communication tools

Proceedings of the 1st international workshop on Semantic models for adaptive interactive systems
Non-syntactic word prediction for AAC

SLPAT '12 Proceedings of the Third Workshop on Speech and Language Processing for Assistive Technologies
Basic word completion and prediction for hebrew

SPIRE'12 Proceedings of the 19th international conference on String Processing and Information Retrieval

Quantified Score

Hi-index	0.00

Visualization

Abstract

Word prediction can be used to enhance the communication rate of people with disabilities who use Augmentative and Alternative Communication (AAC) devices. We use statistical methods in a word prediction system, which are trained on a corpus, and then measure the efficacy of the resulting system by calculating the theoretical keystroke savings on some held out data. Ideally training and testing should be done on a large corpus of AAC text covering a variety of topics, but no such corpus exists. We discuss training and testing on a wide variety of corpora meant to approximate text from AAC users. We show that training on a combination of in-domain data with out-of-domain data is often more beneficial than either data set alone and that advanced language modeling such as topic modeling is portable even when applied to very different text.