Large scale experiments on correction of confused words

Authors:
Jin Hu Huang;David Powers
Affiliations:
The Flinders University of South Australia;The Flinders University of South Australia
Venue:
ACSC '01 Proceedings of the 24th Australasian conference on Computer science
Year:
2001

Citing 6
Cited 4

A note on undetected typing errors

Communications of the ACM
Context based spelling correction

Information Processing and Management: an International Journal
Artificial intelligence (3rd ed.)

Artificial intelligence (3rd ed.)
Techniques for automatically correcting words in text

ACM Computing Surveys (CSUR)
Automatic Rule Acquisition for Spelling Correction

ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Decision lists for lexical ambiguity resolution: application to accent restoration in Spanish and French

ACL '94 Proceedings of the 32nd annual meeting on Association for Computational Linguistics

All-word prediction as the ultimate confusable disambiguation

CHSLP '06 Proceedings of the Workshop on Computationally Hard Problems and Joint Inference in Speech and Language Processing
Language models for contextual error detection and correction

CLAGI '09 Proceedings of the EACL 2009 Workshop on Computational Linguistic Aspects of Grammatical Inference
Word processing in spanish using an english keyboard: a study of spelling errors

UI-HCII'07 Proceedings of the 2nd international conference on Usability and internationalization
Minors as miners: modelling and evaluating ontological and linguistic learning

AusDM '08 Proceedings of the 7th Australasian Data Mining Conference - Volume 87

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper describes a new approach to automatically learn contextual knowledge for spelling and grammar correction --- we aim particularly to deal with cases where the words are all in the dictionary and so it is not obvious that there is an error. Traditional approaches are dictionary based, or use elementary tagging or partial parsing of the sentence to obtain context knowledge. Our approach uses affix information and only the most frequent words to reduce the complexity in terms of training time and running time for context-sensitive spelling correction. We build large scale confused word sets based on keyboard adjacency and apply our new approach to learn the contextual knowledge to detect and correct them. We explore the performance of auto-correction under conditions where significance and probabilty are set by the user.