Part of speech tagging using a network of linear separators

Authors:
Dan Roth;Dmitry Zelenko
Affiliations:
University of Illinois at Urbana-Champaign, Urbana, IL;University of Illinois at Urbana-Champaign, Urbana, IL
Venue:
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
Year:
1998

Citing 6
Cited 19

The weighted majority algorithm

Information and Computation
Transformation-based error-driven learning and natural language processing: a case study in part-of-speech tagging

Computational Linguistics
Learning to resolve natural language ambiguities: a unified approach

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Learning Quickly When Irrelevant Attributes Abound: A New Linear-Threshold Algorithm

Machine Learning
Distributional part-of-speech tagging

EACL '95 Proceedings of the seventh conference on European chapter of the Association for Computational Linguistics
Part-of-speech tagging with neural networks

COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 1

Learning to resolve natural language ambiguities: a unified approach

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
A Winnow-Based Approach to Context-Sensitive Spelling Correction

Machine Learning - Special issue on natural language learning
Coherent Concepts, Robust Learning

SOFSEM '99 Proceedings of the 26th Conference on Current Trends in Theory and Practice of Informatics on Theory and Practice of Informatics
Constraint Classification: A New Approach to Multiclass Classification

ALT '02 Proceedings of the 13th International Conference on Algorithmic Learning Theory
Classification Approach to Word Selection in Machine Translation

AMTA '02 Proceedings of the 5th Conference of the Association for Machine Translation in the Americas on Machine Translation: From Research to Real Users
Combining trigram and automatic weight distribution in Chinese spelling error correction

Journal of Computer Science and Technology
A classification approach to word prediction

NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
A second-order Hidden Markov Model for part-of-speech tagging

ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Bootstrapping bilingual data using consensus translation for a multilingual instant messaging system

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Have things changed now?: an empirical study of bug characteristics in modern open source software

Proceedings of the 1st workshop on Architectural and system support for improving software dependability
Improving Text Summarization Using Noun Retrieval Techniques

KES '08 Proceedings of the 12th international conference on Knowledge-Based Intelligent Information and Engineering Systems, Part II
Using a Bigram Event Model to Predict Causal Potential

CICLing '09 Proceedings of the 10th International Conference on Computational Linguistics and Intelligent Text Processing
Design challenges and misconceptions in named entity recognition

CoNLL '09 Proceedings of the Thirteenth Conference on Computational Natural Language Learning
Learning in natural language

IJCAI'99 Proceedings of the 16th international joint conference on Artificial intelligence - Volume 2
Relational learning for NLP using linear threshold elements

IJCAI'99 Proceedings of the 16th international joint conference on Artificial intelligence - Volume 2
Noun retrieval effect on text summarization and delivery of personalized news articles to the user's desktop

Data & Knowledge Engineering
Machine transliteration survey

ACM Computing Surveys (CSUR)
Resolution of data sparseness in named entity recognition using hierarchical features and feature relaxation principle

CICLing'05 Proceedings of the 6th international conference on Computational Linguistics and Intelligent Text Processing
A robust shallow temporal reasoning system

NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Demonstration Session

Quantified Score

Hi-index	0.00

Visualization

Abstract

We present an architecture and an on-line learning algorithm and apply it to the problem of part-of-speech tagging. The architecture presented, SNOW, is a network of linear separators in the feature space, utilizing the Winnow update algorithm.Multiplicative weight-update algorithms such as Winnow have been shown to have exceptionally good behavior when applied to very high dimensional problems, and especially when the target concepts depend on only a small subset of the features in the feature space. In this paper we describe an architecture that utilizes this mistake-driven algorithm for multi-class prediction-selecting the part of speech of a word. The experimental analysis presented here provides more evidence to that these algorithms are suitable for natural language problems.The algorithm used is an on-line algorithm: every example is used by the algorithm only once, and is then discarded. This has significance in terms of efficiency, as well as quick adaptation to new contexts.We present an extensive experimental study of our algorithm under various conditions; in particular, it is shown that the algorithm performs comparably to the best known algorithms for POS.