A Winnow-Based Approach to Context-Sensitive Spelling Correction

Authors:
Andrew R. Golding;Dan Roth
Affiliations:
MERL—A Mitsubishi Electric Research Laboratory, 201 Broadway, Cambridge, MA 02139. golding@merl.com;Department of Computer Science, University of Illinois—Urbana/Champaign, 1304 W. Springfield Avenue, Urbana, IL 61801. danr@cs.uiuc.edu
Venue:
Machine Learning - Special issue on natural language learning
Year:
1999

Citing 22
Cited 88

Redundant noisy attributes, attribute errors, and linear-threshold learning using winnow

COLT '91 Proceedings of the fourth annual workshop on Computational learning theory
Context based spelling correction

Information Processing and Management: an International Journal
Learning Boolean Functions in an Infinite Attribute Space

Machine Learning
Techniques for automatically correcting words in text

ACM Computing Surveys (CSUR)
The weighted majority algorithm

Information and Computation
Circuits of the mind

Circuits of the mind
On-line prediction and conversion strategies

Euro-COLT '93 Proceedings of the first European conference on Computational learning theory
Support-Vector Networks

Machine Learning
Rationality

COLT '95 Proceedings of the eighth annual conference on Computational learning theory
On the Optimality of the Simple Bayesian Classifier under Zero-One Loss

Machine Learning - Special issue on learning with probabilistic representations
Learning to resolve natural language ambiguities: a unified approach

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Approximate statistical tests for comparing supervised classification learning algorithms

Neural Computation
Learning Quickly When Irrelevant Attributes Abound: A New Linear-Threshold Algorithm

Machine Learning
Automatic Rule Acquisition for Spelling Correction

ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
A decision-theoretic generalization of on-line learning and an application to boosting

EuroCOLT '95 Proceedings of the Second European Conference on Computational Learning Theory
Building a large annotated corpus of English: the penn treebank

Computational Linguistics - Special issue on using large corpora: II
Contextual spelling correction using latent semantic analysis

ANLC '97 Proceedings of the fifth conference on Applied natural language processing
Part of speech tagging using a network of linear separators

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
Decision lists for lexical ambiguity resolution: application to accent restoration in Spanish and French

ACL '94 Proceedings of the 32nd annual meeting on Association for Computational Linguistics
Integrating multiple knowledge sources to disambiguate word sense: an exemplar-based approach

ACL '96 Proceedings of the 34th annual meeting on Association for Computational Linguistics
Combining Trigram-based and feature-based methods for context-sensitive spelling correction

ACL '96 Proceedings of the 34th annual meeting on Association for Computational Linguistics
An empirical study of smoothing techniques for language modeling

ACL '96 Proceedings of the 34th annual meeting on Association for Computational Linguistics

Computational sample complexity and attribute-efficient learning

STOC '99 Proceedings of the thirty-first annual ACM symposium on Theory of computing
Learning a monolingual language model from a multilingual text database

Proceedings of the ninth international conference on Information and knowledge management
A neuroidal architecture for cognitive computation

Journal of the ACM (JACM)
Mining the web to create minority language corpora

Proceedings of the tenth international conference on Information and knowledge management
Linear Concepts and Hidden Variables

Machine Learning
General Convergence Results for Linear Discriminant Updates

Machine Learning
Learning cost-sensitive active classifiers

Artificial Intelligence
Learning to recognize three-dimensional objects

Neural Computation
Learning to Recognize 3D Objects with SNoW

ECCV '00 Proceedings of the 6th European Conference on Computer Vision-Part I
Coherent Concepts, Robust Learning

SOFSEM '99 Proceedings of the 26th Conference on Current Trends in Theory and Practice of Informatics on Theory and Practice of Informatics
Learning Coherent Concepts

ALT '01 Proceedings of the 12th International Conference on Algorithmic Learning Theory
Experiments with Projection Learning

DS '02 Proceedings of the 5th International Conference on Discovery Science
Classification Approach to Word Selection in Machine Translation

AMTA '02 Proceedings of the 5th Conference of the Association for Machine Translation in the Americas on Machine Translation: From Research to Real Users
Combining trigram and automatic weight distribution in Chinese spelling error correction

Journal of Computer Science and Technology
Improving accuracy in word class tagging through the combination of machine learning systems

Computational Linguistics
A classification approach to word prediction

NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
The use of error tags in ARTFL's Encyclopédie: does good error identification lead to good error correction?

Proceedings of the workshop on Student research
New Techniques for Disambiguation in Natural Language and Their Application to Biological Text

The Journal of Machine Learning Research
Word translation disambiguation using bilingual bootstrapping

Computational Linguistics
Building Minority Language Corpora by Learning to Generate Web Search Queries

Knowledge and Information Systems
Correcting real-word spelling errors by restoring lexical cohesion

Natural Language Engineering
Text characteristics of English language university Web sites: Research Articles

Journal of the American Society for Information Science and Technology
Mitigating the paucity-of-data problem: exploring the effect of training corpus size on classifier performance for natural language processing

HLT '01 Proceedings of the first international conference on Human language technology research
Learning question classifiers

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Scaling to very very large corpora for natural language disambiguation

ACL '01 Proceedings of the 39th Annual Meeting on Association for Computational Linguistics
Word translation disambiguation using Bilingual Bootstrapping

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
An improved error model for noisy channel spelling correction

ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
Web-based models for natural language processing

ACM Transactions on Speech and Language Processing (TSLP)
Online Multiclass Learning with k-Way Limited Feedback and an Application to Utterance Classification

Machine Learning
Co-occurrence Retrieval: A Flexible Framework for Lexical Distributional Similarity

Computational Linguistics
Learning in natural language: theory and algorithmic approaches

ConLL '00 Proceedings of the 2nd workshop on Learning language in logic and the 4th conference on Computational natural language learning - Volume 7
Pattern-based disambiguation for natural language processing

EMNLP '00 Proceedings of the 2000 Joint SIGDAT conference on Empirical methods in natural language processing and very large corpora: held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics - Volume 13
Augmented mixture models for lexical disambiguation

EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Improving translation quality of rule-based machine translation

COLING-MTIA '02 Proceedings of the 2002 COLING workshop on Machine translation in Asia - Volume 16
Training a naive bayes classifier via the EM algorithm with a class distribution constraint

CONLL '03 Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003 - Volume 4
Matching inconsistently spelled names in automatic speech recognizer output for information retrieval

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Toward Attribute Efficient Learning of Decision Lists and Parities

The Journal of Machine Learning Research
A phrase-based statistical model for SMS text normalization

COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Successfully detecting and correcting false friends using channel profiles

Proceedings of the second workshop on Analytics for noisy unstructured text data
The importance of syntactic parsing and inference in semantic role labeling

Computational Linguistics
Context Sensitive Paraphrasing with a Global Unsupervised Classifier

ECML '07 Proceedings of the 18th European conference on Machine Learning
Identifying semitic roots: Machine learning with linguistic constraints

Computational Linguistics
Constrained Sequence Classification for Lexical Disambiguation

PRICAI '08 Proceedings of the 10th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence
Matrix representations, linear transformations, and kernels for disambiguation in natural language

Machine Learning
Ordering the suggestions of a spellchecker without using context*

Natural Language Engineering
Combining Methods for Detecting and Correcting Semantic Hidden Errors in Arabic Texts

CICLing '07 Proceedings of the 8th International Conference on Computational Linguistics and Intelligent Text Processing
Glen, Glenda or Glendale: unsupervised and semi-supervised learning of English noun gender

CoNLL '09 Proceedings of the Thirteenth Conference on Computational Natural Language Learning
Knowledge infusion

AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
A panlingual anomalous text detector

Proceedings of the 9th ACM symposium on Document engineering
Transliteration as constrained optimization

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Learning in natural language

IJCAI'99 Proceedings of the 16th international joint conference on Artificial intelligence - Volume 2
Relational learning for NLP using linear threshold elements

IJCAI'99 Proceedings of the 16th international joint conference on Artificial intelligence - Volume 2
All-word prediction as the ultimate confusable disambiguation

CHSLP '06 Proceedings of the Workshop on Computationally Hard Problems and Joint Inference in Speech and Language Processing
Relational learning via propositional algorithms: an information extraction case study

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Real-word spelling correction using Google web 1Tn-gram data set

Proceedings of the 18th ACM conference on Information and knowledge management
Web-scale N-gram models for lexical disambiguation

IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Real-word spelling correction using Google Web IT 3-grams

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Language models for contextual error detection and correction

CLAGI '09 Proceedings of the EACL 2009 Workshop on Computational Linguistic Aspects of Grammatical Inference
Learning with feature description logics

ILP'02 Proceedings of the 12th international conference on Inductive logic programming
Real-word spelling correction with trigrams: a reconsideration of the Mays, Damerau, and Mercer model

CICLing'08 Proceedings of the 9th international conference on Computational linguistics and intelligent text processing
Processing natural language without natural language processing

CICLing'03 Proceedings of the 4th international conference on Computational linguistics and intelligent text processing
Short and informal documents: a probabilistic model for description enrichment

NGITS'09 Proceedings of the 7th international conference on Next generation information technologies and systems
Rewriting the orthography of sms messages

Natural Language Engineering
Training paradigms for correcting errors in grammar and usage

HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Creating robust supervised classifiers via web-scale N-gram data

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Annotating ESL errors: challenges and rewards

IUNLPBEA '10 Proceedings of the NAACL HLT 2010 Fifth Workshop on Innovative Use of NLP for Building Educational Applications
Improved natural language learning via variance-regularization support vector machines

CoNLL '10 Proceedings of the Fourteenth Conference on Computational Natural Language Learning
Generating confusion sets for context-sensitive error correction

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Design and evaluation of an agreement error detection system: testing the effect of ambiguity, parser and corpus type

IceTAL'10 Proceedings of the 7th international conference on Advances in natural language processing
Automated email answering by text pattern matching

IceTAL'10 Proceedings of the 7th international conference on Advances in natural language processing
Managing misspelled queries in IR applications

Information Processing and Management: an International Journal
Hierarchical comments-based clustering

Proceedings of the 2011 ACM Symposium on Applied Computing
A fast and accurate method for approximate string search

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Algorithm selection and model adaptation for ESL correction tasks

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Correcting different types of errors in texts

Canadian AI'11 Proceedings of the 24th Canadian conference on Advances in artificial intelligence
Incorporating external information in bayesian classifiers via linear feature transformations

FinTAL'06 Proceedings of the 5th international conference on Advances in Natural Language Processing
Measures to detect word substitution in intercepted communication

ISI'06 Proceedings of the 4th IEEE international conference on Intelligence and Security Informatics
Correcting semantic collocation errors with L1-induced paraphrases

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Exploiting syntactic and distributional information for spelling correction with web-scale n-gram models

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Online closure-based learning of relational theories

ILP'05 Proceedings of the 15th international conference on Inductive Logic Programming
Real-Word typo detection

NLDB'09 Proceedings of the 14th international conference on Applications of Natural Language to Information Systems
"Could you make me a favour and do coffee, please?": implications for automatic error correction in English and Dutch

SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
On using context for automatic correction of non-word misspellings in student essays

Proceedings of the Seventh Workshop on Building Educational Applications Using NLP
The UI system in the HOO 2012 shared task on error correction

Proceedings of the Seventh Workshop on Building Educational Applications Using NLP
A discriminative model for query spelling correction with latent structural SVM

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Interactive and context-aware tag spell check and correction

Proceedings of the 21st ACM international conference on Information and knowledge management
Detection of semantic errors in Arabic texts

Artificial Intelligence
A new fuzzy rule-based classification system for word sense disambiguation

Intelligent Data Analysis

Quantified Score

Hi-index	0.00

Visualization

Abstract

A large class of machine-learning problems in natural languagerequire the characterization of linguistic context. Two characteristic properties of such problems arethat their feature space is of very high dimensionality,and their target concepts depend on only a small subsetof the features in the space. Under such conditions, multiplicative weight-update algorithmssuch as Winnow have been shown to have exceptionally goodtheoretical properties. In the work reported here, we present an algorithmcombining variants of Winnow and weighted-majority voting,and apply it to a problem in the aforementioned class: context-sensitive spelling correction.This is the task of fixing spelling errors that happen to resultin valid words, such as substituting to for too,casual for causal, and so on.We evaluate our algorithm, WinSpell,by comparing it against BaySpell, a statistics-based methodrepresenting the state of the art for this task.We find: (1) When run with a full (unpruned) set of features,WinSpell achieves accuracies significantly higher than BaySpell was able toachieve in either the pruned or unpruned condition; (2) When compared with other systems in the literature, WinSpell exhibits the highest performance; (3) While several aspects of WinSpell‘s architecturecontribute to its superiority over BaySpell,the primary factor is that it is able to learn a better linear separatorthan BaySpell learns; (4) When run on a test set drawn from a different corpus than the training set was drawn from,WinSpell is better able than BaySpell to adapt,using a strategy we will present that combinessupervised learning on the training setwith unsupervised learning on the (noisy) test set.