Large scale experiments on correction of confused words

  • Authors:
  • Jin Hu Huang;David Powers

  • Affiliations:
  • The Flinders University of South Australia;The Flinders University of South Australia

  • Venue:
  • ACSC '01 Proceedings of the 24th Australasian conference on Computer science
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes a new approach to automatically learn contextual knowledge for spelling and grammar correction --- we aim particularly to deal with cases where the words are all in the dictionary and so it is not obvious that there is an error. Traditional approaches are dictionary based, or use elementary tagging or partial parsing of the sentence to obtain context knowledge. Our approach uses affix information and only the most frequent words to reduce the complexity in terms of training time and running time for context-sensitive spelling correction. We build large scale confused word sets based on keyboard adjacency and apply our new approach to learn the contextual knowledge to detect and correct them. We explore the performance of auto-correction under conditions where significance and probabilty are set by the user.