Redundant noisy attributes, attribute errors, and linear-threshold learning using winnow
COLT '91 Proceedings of the fourth annual workshop on Computational learning theory
Context based spelling correction
Information Processing and Management: an International Journal
Learning Boolean Functions in an Infinite Attribute Space
Machine Learning
Techniques for automatically correcting words in text
ACM Computing Surveys (CSUR)
The weighted majority algorithm
Information and Computation
Circuits of the mind
On-line prediction and conversion strategies
Euro-COLT '93 Proceedings of the first European conference on Computational learning theory
Machine Learning
COLT '95 Proceedings of the eighth annual conference on Computational learning theory
On the Optimality of the Simple Bayesian Classifier under Zero-One Loss
Machine Learning - Special issue on learning with probabilistic representations
Learning to resolve natural language ambiguities: a unified approach
AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Automatic Rule Acquisition for Spelling Correction
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
A decision-theoretic generalization of on-line learning and an application to boosting
EuroCOLT '95 Proceedings of the Second European Conference on Computational Learning Theory
Building a large annotated corpus of English: the penn treebank
Computational Linguistics - Special issue on using large corpora: II
Contextual spelling correction using latent semantic analysis
ANLC '97 Proceedings of the fifth conference on Applied natural language processing
Part of speech tagging using a network of linear separators
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
ACL '94 Proceedings of the 32nd annual meeting on Association for Computational Linguistics
Integrating multiple knowledge sources to disambiguate word sense: an exemplar-based approach
ACL '96 Proceedings of the 34th annual meeting on Association for Computational Linguistics
Combining Trigram-based and feature-based methods for context-sensitive spelling correction
ACL '96 Proceedings of the 34th annual meeting on Association for Computational Linguistics
An empirical study of smoothing techniques for language modeling
ACL '96 Proceedings of the 34th annual meeting on Association for Computational Linguistics
Computational sample complexity and attribute-efficient learning
STOC '99 Proceedings of the thirty-first annual ACM symposium on Theory of computing
Learning a monolingual language model from a multilingual text database
Proceedings of the ninth international conference on Information and knowledge management
A neuroidal architecture for cognitive computation
Journal of the ACM (JACM)
Mining the web to create minority language corpora
Proceedings of the tenth international conference on Information and knowledge management
Linear Concepts and Hidden Variables
Machine Learning
General Convergence Results for Linear Discriminant Updates
Machine Learning
Learning cost-sensitive active classifiers
Artificial Intelligence
Learning to recognize three-dimensional objects
Neural Computation
Learning to Recognize 3D Objects with SNoW
ECCV '00 Proceedings of the 6th European Conference on Computer Vision-Part I
Coherent Concepts, Robust Learning
SOFSEM '99 Proceedings of the 26th Conference on Current Trends in Theory and Practice of Informatics on Theory and Practice of Informatics
ALT '01 Proceedings of the 12th International Conference on Algorithmic Learning Theory
Experiments with Projection Learning
DS '02 Proceedings of the 5th International Conference on Discovery Science
Classification Approach to Word Selection in Machine Translation
AMTA '02 Proceedings of the 5th Conference of the Association for Machine Translation in the Americas on Machine Translation: From Research to Real Users
Combining trigram and automatic weight distribution in Chinese spelling error correction
Journal of Computer Science and Technology
Improving accuracy in word class tagging through the combination of machine learning systems
Computational Linguistics
A classification approach to word prediction
NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
Proceedings of the workshop on Student research
New Techniques for Disambiguation in Natural Language and Their Application to Biological Text
The Journal of Machine Learning Research
Word translation disambiguation using bilingual bootstrapping
Computational Linguistics
Building Minority Language Corpora by Learning to Generate Web Search Queries
Knowledge and Information Systems
Correcting real-word spelling errors by restoring lexical cohesion
Natural Language Engineering
Text characteristics of English language university Web sites: Research Articles
Journal of the American Society for Information Science and Technology
HLT '01 Proceedings of the first international conference on Human language technology research
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Scaling to very very large corpora for natural language disambiguation
ACL '01 Proceedings of the 39th Annual Meeting on Association for Computational Linguistics
Word translation disambiguation using Bilingual Bootstrapping
ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
An improved error model for noisy channel spelling correction
ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
Web-based models for natural language processing
ACM Transactions on Speech and Language Processing (TSLP)
Co-occurrence Retrieval: A Flexible Framework for Lexical Distributional Similarity
Computational Linguistics
Learning in natural language: theory and algorithmic approaches
ConLL '00 Proceedings of the 2nd workshop on Learning language in logic and the 4th conference on Computational natural language learning - Volume 7
Pattern-based disambiguation for natural language processing
EMNLP '00 Proceedings of the 2000 Joint SIGDAT conference on Empirical methods in natural language processing and very large corpora: held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics - Volume 13
Augmented mixture models for lexical disambiguation
EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Improving translation quality of rule-based machine translation
COLING-MTIA '02 Proceedings of the 2002 COLING workshop on Machine translation in Asia - Volume 16
Training a naive bayes classifier via the EM algorithm with a class distribution constraint
CONLL '03 Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003 - Volume 4
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Toward Attribute Efficient Learning of Decision Lists and Parities
The Journal of Machine Learning Research
A phrase-based statistical model for SMS text normalization
COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Successfully detecting and correcting false friends using channel profiles
Proceedings of the second workshop on Analytics for noisy unstructured text data
The importance of syntactic parsing and inference in semantic role labeling
Computational Linguistics
Context Sensitive Paraphrasing with a Global Unsupervised Classifier
ECML '07 Proceedings of the 18th European conference on Machine Learning
Identifying semitic roots: Machine learning with linguistic constraints
Computational Linguistics
Constrained Sequence Classification for Lexical Disambiguation
PRICAI '08 Proceedings of the 10th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence
Ordering the suggestions of a spellchecker without using context*
Natural Language Engineering
Combining Methods for Detecting and Correcting Semantic Hidden Errors in Arabic Texts
CICLing '07 Proceedings of the 8th International Conference on Computational Linguistics and Intelligent Text Processing
Glen, Glenda or Glendale: unsupervised and semi-supervised learning of English noun gender
CoNLL '09 Proceedings of the Thirteenth Conference on Computational Natural Language Learning
AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
A panlingual anomalous text detector
Proceedings of the 9th ACM symposium on Document engineering
Transliteration as constrained optimization
EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
IJCAI'99 Proceedings of the 16th international joint conference on Artificial intelligence - Volume 2
Relational learning for NLP using linear threshold elements
IJCAI'99 Proceedings of the 16th international joint conference on Artificial intelligence - Volume 2
All-word prediction as the ultimate confusable disambiguation
CHSLP '06 Proceedings of the Workshop on Computationally Hard Problems and Joint Inference in Speech and Language Processing
Relational learning via propositional algorithms: an information extraction case study
IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Real-word spelling correction using Google web 1Tn-gram data set
Proceedings of the 18th ACM conference on Information and knowledge management
Web-scale N-gram models for lexical disambiguation
IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Real-word spelling correction using Google Web IT 3-grams
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Language models for contextual error detection and correction
CLAGI '09 Proceedings of the EACL 2009 Workshop on Computational Linguistic Aspects of Grammatical Inference
Learning with feature description logics
ILP'02 Proceedings of the 12th international conference on Inductive logic programming
CICLing'08 Proceedings of the 9th international conference on Computational linguistics and intelligent text processing
Processing natural language without natural language processing
CICLing'03 Proceedings of the 4th international conference on Computational linguistics and intelligent text processing
Short and informal documents: a probabilistic model for description enrichment
NGITS'09 Proceedings of the 7th international conference on Next generation information technologies and systems
Rewriting the orthography of sms messages
Natural Language Engineering
Training paradigms for correcting errors in grammar and usage
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Creating robust supervised classifiers via web-scale N-gram data
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Annotating ESL errors: challenges and rewards
IUNLPBEA '10 Proceedings of the NAACL HLT 2010 Fifth Workshop on Innovative Use of NLP for Building Educational Applications
Improved natural language learning via variance-regularization support vector machines
CoNLL '10 Proceedings of the Fourteenth Conference on Computational Natural Language Learning
Generating confusion sets for context-sensitive error correction
EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
IceTAL'10 Proceedings of the 7th international conference on Advances in natural language processing
Automated email answering by text pattern matching
IceTAL'10 Proceedings of the 7th international conference on Advances in natural language processing
Managing misspelled queries in IR applications
Information Processing and Management: an International Journal
Hierarchical comments-based clustering
Proceedings of the 2011 ACM Symposium on Applied Computing
A fast and accurate method for approximate string search
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Algorithm selection and model adaptation for ESL correction tasks
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Correcting different types of errors in texts
Canadian AI'11 Proceedings of the 24th Canadian conference on Advances in artificial intelligence
Incorporating external information in bayesian classifiers via linear feature transformations
FinTAL'06 Proceedings of the 5th international conference on Advances in Natural Language Processing
Measures to detect word substitution in intercepted communication
ISI'06 Proceedings of the 4th IEEE international conference on Intelligence and Security Informatics
Correcting semantic collocation errors with L1-induced paraphrases
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Online closure-based learning of relational theories
ILP'05 Proceedings of the 15th international conference on Inductive Logic Programming
NLDB'09 Proceedings of the 14th international conference on Applications of Natural Language to Information Systems
SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
On using context for automatic correction of non-word misspellings in student essays
Proceedings of the Seventh Workshop on Building Educational Applications Using NLP
The UI system in the HOO 2012 shared task on error correction
Proceedings of the Seventh Workshop on Building Educational Applications Using NLP
A discriminative model for query spelling correction with latent structural SVM
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Interactive and context-aware tag spell check and correction
Proceedings of the 21st ACM international conference on Information and knowledge management
Detection of semantic errors in Arabic texts
Artificial Intelligence
A new fuzzy rule-based classification system for word sense disambiguation
Intelligent Data Analysis
Hi-index | 0.00 |
A large class of machine-learning problems in natural languagerequire the characterization of linguistic context. Two characteristic properties of such problems arethat their feature space is of very high dimensionality,and their target concepts depend on only a small subsetof the features in the space. Under such conditions, multiplicative weight-update algorithmssuch as Winnow have been shown to have exceptionally goodtheoretical properties. In the work reported here, we present an algorithmcombining variants of Winnow and weighted-majority voting,and apply it to a problem in the aforementioned class: context-sensitive spelling correction.This is the task of fixing spelling errors that happen to resultin valid words, such as substituting to for too,casual for causal, and so on.We evaluate our algorithm, WinSpell,by comparing it against BaySpell, a statistics-based methodrepresenting the state of the art for this task.We find: (1) When run with a full (unpruned) set of features,WinSpell achieves accuracies significantly higher than BaySpell was able toachieve in either the pruned or unpruned condition; (2) When compared with other systems in the literature, WinSpell exhibits the highest performance; (3) While several aspects of WinSpell‘s architecturecontribute to its superiority over BaySpell,the primary factor is that it is able to learn a better linear separatorthan BaySpell learns; (4) When run on a test set drawn from a different corpus than the training set was drawn from,WinSpell is better able than BaySpell to adapt,using a strategy we will present that combinessupervised learning on the training setwith unsupervised learning on the (noisy) test set.