A note on undetected typing errors
Communications of the ACM
Context based spelling correction
Information Processing and Management: an International Journal
Artificial intelligence (3rd ed.)
Artificial intelligence (3rd ed.)
Techniques for automatically correcting words in text
ACM Computing Surveys (CSUR)
Automatic Rule Acquisition for Spelling Correction
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
ACL '94 Proceedings of the 32nd annual meeting on Association for Computational Linguistics
All-word prediction as the ultimate confusable disambiguation
CHSLP '06 Proceedings of the Workshop on Computationally Hard Problems and Joint Inference in Speech and Language Processing
Language models for contextual error detection and correction
CLAGI '09 Proceedings of the EACL 2009 Workshop on Computational Linguistic Aspects of Grammatical Inference
Word processing in spanish using an english keyboard: a study of spelling errors
UI-HCII'07 Proceedings of the 2nd international conference on Usability and internationalization
Minors as miners: modelling and evaluating ontological and linguistic learning
AusDM '08 Proceedings of the 7th Australasian Data Mining Conference - Volume 87
Hi-index | 0.00 |
This paper describes a new approach to automatically learn contextual knowledge for spelling and grammar correction --- we aim particularly to deal with cases where the words are all in the dictionary and so it is not obvious that there is an error. Traditional approaches are dictionary based, or use elementary tagging or partial parsing of the sentence to obtain context knowledge. Our approach uses affix information and only the most frequent words to reduce the complexity in terms of training time and running time for context-sensitive spelling correction. We build large scale confused word sets based on keyboard adjacency and apply our new approach to learn the contextual knowledge to detect and correct them. We explore the performance of auto-correction under conditions where significance and probabilty are set by the user.