Analysing performance in a word prediction system with multiple prediction methods

Authors:
Pertti Alvar Väyrynen;Kai Noponen;Tapio Seppänen
Affiliations:
Oulun yliopisto Sähkö-ja tietotekniikan osasto, Tietokonetekniikan laboratorio, PL 4500, 90014 Oulun yliopisto, Finland;Oulun yliopisto Sähkö-ja tietotekniikan osasto, Tietokonetekniikan laboratorio, PL 4500, 90014 Oulun yliopisto, Finland;Oulun yliopisto Sähkö-ja tietotekniikan osasto, Tietokonetekniikan laboratorio, PL 4500, 90014 Oulun yliopisto, Finland
Venue:
Computer Speech and Language
Year:
2007

Citing 7
Cited 0

Predictive interfaces: what will they think of next?

Extra-ordinary human-computer interaction
Handbook of Natural Language Processing

Handbook of Natural Language Processing
Abbreviated text input

Proceedings of the 8th international conference on Intelligent user interfaces
FASTY - A Multi-lingual Approach to Text Prediction

ICCHP '02 Proceedings of the 8th International Conference on Computers Helping People with Special Needs
User-friendly text prediction for translators

EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
The wild thing!

ACLdemo '05 Proceedings of the ACL 2005 on Interactive poster and demonstration sessions
Exploiting long distance collocational relations in predictive typing

TextEntry '03 Proceedings of the 2003 EACL Workshop on Language Modeling for Text Entry Methods

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this article, we investigate the performance of a hybrid prediction system with a phrase prediction utility in English word prediction from two viewpoints. From the application user's point of view, measures of effort savings are important in word prediction. Global performance measures such as the average percentage of keystroke or character savings, however, hide rather than display the details of the functioning of the prediction system as a whole. In the present study, we analysed in detail the performance of a prediction system with a phrase prediction utility along with single word prediction. Our preliminary results with a corpus of 383 lexical bundles show that, from a technological viewpoint, the following three parameters affect the practical utility of the phrase prediction method in a hybrid prediction system: (1) cost of selecting an appropriate prediction mode for single word prediction and phrase prediction; (2) token frequency of phrases in the text predicted, and (3) coverage of the phrasal lexicon. We found that all three affect the phrase prediction performance in different proportions. When the percent of ambiguous search keys finding both phrases and single words is 20%, phrase frequency 35%, and coverage of the phrasal lexicon 98%, the character savings percentage for the whole text will be improved by 6% points under optimal conditions. The system is practically useful as long as an appropriate prediction mode can be selected automatically or the cost of disambiguation of a prediction mode is not too high.